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Nucleic Acids, Proteins, and Antibodies 

[001] This application refers to a "Sequence Listing" that is provided only on electronic 
media in computer readable form pursuant to Administrative Instructions Section 
801(a)(i). The Sequence Listing forms a part of this description pursuant to Rule 5.2 
and Administrative Instructions Sections 801 to 806, and is hereby incorporated in its 
entirety. 

[002] The Sequence Listing is provided as an electronic file (PCOIOPCT_seqList.txt, 
23,109,132 bytes in size, createcj on January 12, 2001) on four identical compact discs 
(CD-R), labeled "COPY 1," "COPY 2," "COPY 3," and "CRF." The Sequence 
Listing complies with Annex C of ttie Administrative Instructions, and may be viewed, 
for example, on an IBM-PC machine ruiming the MS-Windows operating system by 
using the V viewer software, version 2000 (see World Wide Web URL: 
http:/Avww.fileviewerxom). 

Field of the Invention 
[003] The present invention relates to novel nervous system related polynucleotides, the 
polypeptides encoded by these polynucleotides herein collectively referred to as 
**nervous system antigens,'* and antibodies that immunospecifically bind these 
polypeptides, and the use of such nervous system polynucleotides, antigens, and 
antibodies fox- detecting, treating, preventing and/or prognosing disorders of the 
nervous system, including, but not limited to, the presence of cancers of the nervous 
system and metastases of nervous system cancers. More specifically, isolated nervous 
system nucleic acid molecules are provided encoding novel nervous system 
polypeptides. Novel nervous system polypeptides and antibodies that bind to these 
polypeptides are provided. Also provided are vectors, host cells, and recombinant and 
synthetic methods for producing human nervous system polynucleotides, polypeptides, 
and/or antibodies. The invention further relates to diagnostic abd therapeutic methods 
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useful for diagnosing, treating, preventing and/or prognosing disorders related to the 
nervous system, including cancers of the nervous system, and therapeutic methods for 
treating such disorders. The invention further relates to screening methods for 
identifying agonists and antagonists of polynucleotides and polypeptides of the 
invention. The invention further relates to methods and/or compositions for inhibiting 
or promoting the production and/or function of the polypeptides of the invention. 

Background of the Invention 

[004] The brain is the control center of tibie body, encoding such functions as flie ability 
to move, touch, taste, smell, hear, and see, for example. It reviews all stimuli, whether 
from intemal organs or the surface of the body, and generates a reaction, such as 
movement of the limbs, adjustment of the rate at which intemal organs function, 
and/or alteration of mood. Stimuli and reactions are transmitted to and from the brain 
via the spinal cord, a collection of nerves encased within bony vertebrae. Both the 
brain and spinal cord are wrapped in three layers of tissue, collectively called the 
meninges, which provide cushioning and protection. Together, these components 
make up the central nervous system (CNS). 

[005] The human brain is subdivided into three major segments: the brain stem, 
midbrain, and forebrain. The brain stem is considered to be the seat of the "primitive 
brain". It comprises such structures as the medulla and cerebellum, which control 
basic functions Hke breathing, heart rate, and digestion and the coordination of the 
senses and muscle movement, respectively. Many of these features are homologous 
across species. The midbrain controls many sensory and motor functions, including 
eye movement, and links the brain stem to such structures as the thalamus (for 
information relay) and hypothalamus (which is instrumental in regulating autonomic 
functions,^ like maintaining body temperature, regulating water balance, and 
controlling sleep). The forebrain is associated with the "high-level" functions of 
complex organisms. This area includes specialized regions for the control of skilled 
motor behaviors (e.g., speech, mood, thought, and planning for the fixture), 
interpretation of sensory input from the rest of the body, control of voluntary body 
movements, interpretation of vision, retrieval of long-term memories, recognition of 
familiar objects, and initiation of communication or action. 
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[006] Despite being encapsulated in the thick, hard bones of the skull, the brain is 
susceptible to many kinds of injury. Common injuries resulting from head trauma 
include herniation, edema, hematomas ( subdural and epidural) , amnesia, coma, 
stupor, delirium, persistent or chronic vegetative state, concussion and post-concussion 
syndrome, cerebral contusions, damage to specific brain areas (e.g., the aphasias, 
apraxia, agnosia, and amnesia), and posttraumatic epilepsy. Bacteria and other 
infectious organisms can reach the CNS in through the blood stream or by penetration 
through an injury or surgery wound, leading to several serious diseases, such as 
bacterial meningitis, Waterhouse-Friderichsen syndrome, chronic meningitis, viral 
meningitis (e.g., lyphocytic choriomeningitis), bacterial meningitis (e.g., Haemophilus, 
Listeria, Meningococcal, pneumococcal, or meningeal tuberculosis), encephalitis, 
encephalomyeUtis, Hallervorden-Spatz syndrome, aseptic meningitis, parainfectious 
encephalitis, subacute sclerosing panencephalitis, brain abscesses, AE)S dementia 
complex, Japanese encephalitis, St Louis encephalitis, Tick-bom encephalitis. West 
Nile Fever encephalitis, postencephalitic Parkinson disease, necrotizing hemorrhagic 
encephalomyeUtis, visna, cerebral malaria, neurosyphihs (e.g., tabes dorsalis), 
subdural empyema, cysticercosis, schistosomiasis, echinococcosis, coenurosis, 
cerebral toxoplasmosis, and prion diseases (e.g., Creutzfelt Jakob syndrome, bovine 
spongiform encephalopathy, Gerstmann-Straussler syndrome, kum, or scrapie). Other 
brain diseases include hydrocephalus (e.g., Dandy-Walker syndrome or normal 
pressure hydroencephalitis), Rhett syndrome, Reye's syndrome, pseudotumor cerebri, 
intracranial tuberculoma, Zellweger syndrome, narcolepsy, cataplexy, and cerebellar 
diseases. The spinal cord is equally susceptible to injury and disease, which can result 
in cervical spondylosis, cysts, acute transverse myelitis, spinal hematoma, nerve root 
disorders (e.g., sciatica, spinal stenosis, and shingles), ruptured disk, and spinal cord 
compression. Together, this illustrates the relative frailty of the CNS. 
[007] The peripheral nervous system (PNS) includes all nerves outside the CNS: the 
cranial nerves that coimect the head and face directly to the brain, the nerves that 
connect the eyes and nose to the brain, and all flie nerves that connect the spinal cord 
to the rest of the body. The brain communicates with much of the body through the 
thirty-one pairs of spinal nerves that emerge from the spinal cord. Each pair iiicludes 
one nerve at the front of the spinal cord, which carries information from the brain to 
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the muscles, and one nerve located at the back of the spinal cord, which caiiies 
sensory information to the brain. Peripheral nerves are actually bundles of nerve fibers 
- some of which are very small (less than 1/64 of and inch in diameter) and others are 
quite large. Large fibers convey the messages that activate muscles (motor nerves) 
and the sensations of touch and position (sensory nerves), whereas small fibers 
convey sensations of pain and temperature and control the automatic functions of the 
body, such as heart rate and blood pressure (autonomic nerves). 
[008] Like the CNS, damage to the PNS results in several known disease states with 
effects seen throughout the body. Disorders of muscle stimulation include 
amyotrophic lateral sclerosis, progressive muscular atrophy, progressive bulbar palsy, 
Werdnig-HofEbaan disease, intermediate spinal muscular atrophy, infantile and 
juvenile muscular atrophy, poliomyelitis and the post polio syndrome, primary lateral 
sclerosis, Wohlfart-Kugelberg-Welander disease, and progressive pseudobulbar palsy. 
Malfunction of the cranial nerves that lead directly fix)m the brain to various parts of 
the head also results in several known disorders, such as trigeminal neuralgia, 
glossopharyngeal neuralgial, and Bell's palsy. Other diseases of the PNS include 
plexus disorders (e.g., acute brachial neuritis), thoracic outlet syndromes, 
mononeuropathy (e.g., carpal tunnel syndrome, leprosy, ulnar nerve palsy, radial nerve 
palsy, and peroneal nerve palsy), multiple mononeuropathy, polyneuropathy (e.g., 
chronic polyneuropathy and diabetic neuropathy), Guillain-Barre syndrome, and 
heredtiary neuropathies (e.g., Charcot-Marie-Tooth disease and Dejerine-Sottas 
disease). 

[0091 Nerve cells are the fundamental elements of both the CNS and PNS. In total, there 
are an estimated 100 billion neurons in a human body. While neurons are similar to 
other cells of the body in their general organization, they also posses highly 
.specialized and unique features which are critical to the function of the nervous 
system. Each neuron is comprised of four distinct regions: the cell body, a single 
axon, dendrites, and axon terminals. The cell body contains the nucleus and other 
organelles necessary for the life and functioning of the neuron. The dendrites are 
processes that extend outward fiom the cell body and receive signals fi-om sensory 
organs or fi-om other neurons. In the dendrites, incoming signals are converted to 
electrical impulses and transmitted to the cell body for processing. A single axon 
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extends from the cell body, which conducts information from the cell body to organs, 
muscles, or other neurons. At the end of the axon is an array of axon termini. These 
termini are the transmitting elements of a neuron. By means of these termini, an axon 
is able to transmit information to flie receptive surfaces (typically the dendrites or the 
cell body) of other nexu-ons or muscle cells. 
[010] Other cellular structures crucial for neural transmission are the cytoskeletal fibers, 
including microtubules and neurofilaments, which run the length of the axon and 
function in transporting proteins, vesicles, and other macromolecules to the axon 
terminal. Additionally, some axons are surrounded by a myelin sheath made up of 
membranes from either oligodendrocyte cells (CNS) or Schwann cells (PNS). 
Myelinated axons conduct electrical hnpulses faster than unmyelinated ones of the 
same diameter. Damage to the myehn sheath has been associated with several known 
disease states, including multiple sclerosis, acute disseminated encephalomyelitis, 
Canavan disease, diffiise cerebral sclerosis, encephalitis periaxialis, global cell 
leukodystrophy, metachromatic leukodystrophy, allergic encephalomyelitis, 
necrotizing hemorrhagic encephalomyelitis, progressive multifocal 
leukoencephalopathy, central pontine myelinolysis, transverse myelinolysis, 
neuromyelitis optica, scrapie, swayback, adrenoleukodystrophy,' 
adrenomyeloneuropathy, Leber's hereditary optic atrophy, and HTLV-associated 
myelopathy. 

[OllJ Contact between neurons occurs at a specialized site called a synapse. At this site, 
the axon temiinal from one neuron (the presynaptic cell) sends a signal to another 
neuron (the postsynaptic cell). Synapses may be connected either electrically or 
chemically. An electrical synapse consists of gap junctions that directly connect two 
neurons. This allows electrical signals to pass unabated from the presynaptic to 
postsynaptic neuron. 

1012] The electrical signals are produced by temporary changes in the current flow into 
and out of the cell. Ion channels embedded in the membrane regulate current flow by 
selectively regulating the passage of a specific ion or ions across the membrane. There 
are two types of ion channels found in neural membranes - gated and non-gated. Non- 
gated channels are always open and are not significantly influenced by changes in 
external factors. These ion channels primarily fimction in maintaining the resting 
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membrane potential, or electrical potential across the membrane, of the neuron. Gated 
channels, in contrast, exist in two stable conformations - open and closed. Most gated 
channels are closed when the membrane is at its resting potential, and open when 
stimulated by external factors such as a change in membrane potential, ligand binding, 
or membrane stretch. Agonists, antagonists, and antibodies that bind to or block ion 
channels are extremely useful tools for studying brain function, which could lead to 
significant advances in understanding disease and the development of therapies. For 
example, tertrodotoxin (TTX), isolated fit>m the poison sacks of the puffer fish, 
selectively blocks the voltage-gated sodium channels necessaiy for producing an 
excitatory electrical potential. This provides the researcher with a powerful tool for 
studying the effects of activity blockade on such processes as neural network 
development, learning and memory. 
[013] In chemical synapses, the axon termini of the presynaptic cell contain vesicles 
filled with a particular molecule (neurotransmitter). An electrical signal from the cell 
body travels down the axon to the axon termini, where it triggers the release of 
neurotransmitter fi-om the vesicle by exocytosis. The neurotransmitter rapidly difftises 
across the synaptic cleft separating the presynaptic fi-om the postsynaptic neuron. The 
neurotransmitter then binds to receptors located on the dendrites of the postsynaptic 
neuron, which open ion channels and provokes a change in the cell's electric potential. 
This change in electrical potential prompts further transmission of the signal. 
[014] Signal transmission between a neuron and muscle cell occurs via a similar 
mechanism. At the neuromuscular junction, axon termini reside adjacent to muscle 
cells within depressions formed in the motor end-plate. An electrical signal prompts 
the release of neurotransmitter from axon termini, which diffuses across the synaptic 
clefl and binds to receptors located on the surface of the muscle cell. Binding of 
neurotransmitter provokes an electrical response that stimulates contraction of the 
muscle. Dysfunction of the neuromuscular junction plays a role in several 
neurological disorders. For example, in myasthenia gravis the immune system 
produces antibodies that attack the neurotransmitter receptors located on the muscle, 
preventing neurotransmitter binding and muscle contraction. Additionally, these 
antibodies can also be transferred firom mother to child through the placenta, resulting 
in a variation of the disease called neonatal myasthenia, whose symptoms typically 
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disappear shortly after birth. Other known neuromuscular junction disorders include 
Eaton-Lambert syndrome and botulism. 

[015J Neurotransmitters comprise a diverse group of small molecules, such as L- 
glutamine and acetylcholine, or peptides Uke enkephalin (McCance and Huenther, 
Pathophysiology, the Biological Basis for Disease in Adults and Humans,2"'^ edition, 
pp.403-404 (1994)). Neurotransmitters are synthesized within the cell body of the 
presynaptic neuron and transported to the axon teraiini in vesicles, where they reside 
until exocytosed. The effects of neurotransmitters can be excitatory (e.g initiation of 
neuron stimulation) or inhibitory (e.g., to hyperpolarize the plasma membrane and 
inhibit signal transmission). Many neurotransmitters are capable of eliciting either an 
excitatory or inhibitory response, dependent on the number and type of receptors 
located on the postsynaptic neuron. 

[016] Current medical research efforts have identified a role for neurotransmitters and 
their receptors as targets of pharmacologicar agents aimed at controlling neurological 
fimction. For example, sedatives, such as benzodiazepines and barbituates, mimic the 
effect of the neurotransmitter GABA, which is known to be the primary inhibitory 
neurotransmitter in the CNS (Katzung, Basic and Clinical Phamiacology, 6* edition, 
338-339 (1 995)). The aberrant activity of neurotransmitters and their receptors has 
been linked to a number of neurological disorders, including Alzheimer's disease, 
Parkinson's disease, epilepsy, stroke, and myasthenia gravis (Planells-Cases at al., 
PNAS 90: 5057-5061 (1993)), identifying an important need for the discovery of novel 
polypeptides, agonists, antagonists, and corresponding to neurotransmitters. 

[017] hi adult humans, each neuron is connected to approximately ten thousand other 
neurons (Tessier-Lavigne et al.. Science 274: 1123-1133 (1996)). While the overall 
program for determining which neurons should be connected together is under genetic 
control, it is extemal stimuli from sensory neurons that are crucially important in 
determining what network connections are actually made. To clarify, precise neural 
wiring is not ftiUy developed at birth, but only roughly approximates the final network 
required to be fiiUy functional. During embryonic development, neural coimections are 
initiated via the programmed extrasion of axons, tipped at the leading end with a 
growth cone that is guided by molecular cues. Throughout post-natal development, 
this coarse pattern of coimections is refined based on specific interactions between the 
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organism and its environment - essentially, through learning. This process can be 
modulated by normal and aberrant experiences, both having a more profound effect 
during early stages of postnatal development than in adulthood. 

(018] Several changes occur in the brain throughout aging. Gross changes include 
decreases in brain weight, the production of certain proteins, and the total number of 
neurons in many brain regions. Additionally, there are age-related alterations in the 
synthesis and degradation of neurotransmitters and their receptors that are believed to 
cause some of the characteristics of senescence: changes in sleep patterns, mood, 
appetite, neuroendocrine functions, motor activity, and memory. While these changes 
are considered normal, a number of diseases have been identified that result from 
aberrant age-related changes, such as Alzheimer's disease, Parkinson's disease, 
Huntington's disease, Pick's disease, and dementia. 

1019] The discovery of new human nervous system associated polynucleotides, the 
polypeptides encoded by them, and antibodies that immunospecifically bind these 
polypeptides, satisfies a need in the art by providing new compositions which are 
useful in the diagnosis, treatment, prevention and/or prognosis of disorders of the 
nervous system, including, but not limited to, neuropsychiatric disorders, 
neurodegenerative diseases, vascular disorders, developmental disorders, infections, 
and neoplastic disorders. 


Summary of the Invention 
[020] The present invention relates to novel nervous system related polynucleotides, the 
polypeptides encoded by these polynucleotides herein collectively refenred to as 
'•nervous system antigens," and antibodies that unmunospecifically bind these 
polypeptides, and the use of such nervous system polynucleotides, antigens, and 
antibodies for detecting, treating, preventing and/or prognosing disorders of the 
nervous system, including, but not lunited to, the presence of cancers of the nervous 
system and metastases of cancers of the nervous system. More specifically, isolated 
nervous system nucleic acid molecules are provided encoding novel nervous system 
polypeptides. Novel nervous system polypeptides and antibodies that bind to these 
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polypeptides are provided. Also provided are vectors, host cells, and recombinant and 
synthetic methods for producing human nervous system polynucleotides, polypeptides, 
and/or antibodies. The invention further relates to diagnostic and ther£?)eutic methods 
useful for diagnosing, treating, preventing and/or prognosing disorders related to the 
nervous system, including cancers of the nervous system, and therapeutic methods for 
treating such disorders. The invention further relates to screening methods for 
identifying agonists and antagonists of polynucleotides and polypeptides of the 
mvention. The invention further relates to methods and/or compositions for inhibiting 
or promoting the production and/or function of the polypeptides of the invention. 

Detailed Description 

Tables 

[021] Table 1 A summarizes some of the polynucleotides encompassed by the mvention 

(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences 

(contig identifier (Contig ID:) and contig nucleotide sequence identifier (SEQ E) 

NO:X)) and further summarizes certain characteristics of these polynucleotides and the 

polypeptides encoded thereby. The first colunm provides a unique clone identifier, 

"Clone ID NO:Z", for a cDNA plasmid related to each nervous system associated 

contig sequence disclosed in Table 1 A. The second colunm provides a unique contig 

identifier, "Contig ID:" for each of the contig sequences disclosed in Table lA. The 

third column provides the sequence identifier, "SEQ ID NO:X", for each of the contig 

polynucleotide sequences disclosed in Table lA. The fourth column, "ORF (From- 

To)", provides the location (i.e., nucleotide position numbers) within the 

polynucleotide sequence of SEQ ID NO:X that delineate the preferred open reading 

fi^e (ORF) shown in the sequence listing and referenced in Table lA as SEQ ID 

NO:Y (colunm 5). Column 6 lists residues comprising predicted epitopes contained in 

the polypeptides encoded by each of the preferred ORFs (SEQ JD NO:Y). 

Identification of potential immunogenic regions was performed according to tiie 

method of Jameson and Wolf (CABIOS, 4:181-186 (1988)); specifically, the Genetics 

Computer Group (GCG) implementation of this algorithm, embodied in the program 

PEPTIDESTRUCTURE (Wisconsin Package vlO.O, Genetics Computer Group 

(GCG), Madison, Wise). This method retums a measure of the probability that a given 
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residue is found on the surface of the protein. Regions where the antigenic index score 
is greater than 0.9 over at least 6 amino acids are indicated in Table 1 A as 'Tredicted 
Epitopes." hi particular embodiments, nervous system associated polypeptides of the 
invention comprise, or alternatively consist of, one, two, three, four, five or more of 
the predicted epitopes described in Table 1 A. It will be appreciated that depending on 
the analytical criteria used to predict antigenic determinants, the exact address of the 
determinant may vary slightly. Colunm 7, "Tissue Distribution" shows the expression 
profile of tissue, cells, and/or cell Une libraries which express the polynucleotides of 
the invention. The first number in colunm 7 (preceding the colon), represents the 
tissue/cell source identifier code correq)onding to the code and description provided in 
Table 4. Expression of these polynucleotides was not observed in thei other tissues 
and/or cell libraries tested. For those identifier codes in which the first two letters are 
not "AR", the second niunber m column 7 (following the colon), represents the 
number of times a sequence corresponding to the reference polynucleotide sequence 
(e.g., SEQ ID NO:X) was identified in the tissue/cell source. Those tissue/cell source 
identifier codes in which the first two letters are "AR" designate inforaiation 
generated using DNA array technology. UtiUzing this technology, cDNAs were 
amplified by PGR and then transferred, in duplicate, onto the array. Gene expression 
was assayed through hybridization of first strand cDNA probes to the DNA array. 
cDNA probes were generated &om total RNA extracted &om a variety of different 
tissues and cell lines. Probe synthesis was performed in the presence of ^^P dCTP, 
using oligo(dT) to prime reverse transcription. After hybridization, high stringency 
washing conditions were employed to remove non-specific hybrids fi-om the array. The 
remaining signal, emanating fi-om each gene target, was measured using a 
Phosphorimager. Gene expression was reported as Phosphor Stimulating 
Luminescence (PSL) which reflects the level of phosphor signal generated firom the 
probe hybridized to each of the gene targets represented on the array. A local 
background signal subtraction was performed before the total signal generated fi-om 
each array was used to normalize gene expression between the different 
hybridizations. The value presented after "[array code]:" rq)resents the mean of the 
duplicate values, following background subtraction and probe normahzation. One of 
skill in the art could routinely use this information to identify normal and/or diseased 
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tissue(s) which show a predommant expression pattern of the corresponding 
polynucleotide of the invention or to identify polynucleotides which show 
predominant and/or specific tissue and/or cell expression. Column 8, "Cytologic 
Band " provides the chromosomal location of polynucleotides corresponding to SEQ 
ID NO:X. Chromosomal location was determined by finding exact matches to EST 
and cDNA sequences contained in the NCBI (National Center for Biotechnology 
Mbrmation) UniGene database. Given a presumptive chromosomal location, disease 
locus association was deteraiined by comparison with the Morbid Map, derived fi-om 
Online Mendelian Inheritance in Man (Online MendeKan Inheritance in Man, 
OMIM™. McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins 
University (Baltimore, MD) and National Center for Biotechnology Information, 
National Library of Medicine (Bethesda, MD) 2000, World Wide Web URL: 
http://www.ncbi.nlm.nih.gov/omim/). If the putative chromosomal location of the 
Query overlapped with the chromosomal location of a Morbid Map entry, an OMIM 
identification number is provided in Table lA, column 9 labeled "OMIM Disease 
Reference(s)". A key to the OMIM reference identification numbers is provided in 
Tables. 

[022] Table IB suimnarizes additional polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences 
(contig identifier (Contig ID:) contig nucleotide sequence identifiers (SEQ ID NO:X)), 
and genomic sequences (SEQ ID NO:B). The first column provides a unique clone 
identifier, "Clone ID NO:Z", for a cDNA clone related to each contig sequence. The 
second column provides the sequence identifier, "SEQ ID NO:X", for each contig 
sequence. The third column provides a unique contig identifier, "Contig ID:" for each 
contig sequence. The fourth column, provides a BAC identifier "BAC ID NO:A" for 
the BAC clone referenced in the corresponding row of the table. The fifth column 
provides the nucleotide sequence identifier, "SEQ ID NO:B" for a fi-agment of the 
BAC clone identified in column four of the corresponding row of the table. The sixth 
column, "Exon From-To", provides the location (i.e., nucleotide position numbers) 
within the polynucleotide sequence of SEQ ID NO:B which delineate certain 
polynucleotides of the invention that are also exemplary members of polynucleotide 
sequences that encode polypeptides of the invention (e.g., polypeptides containing 
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amino acid sequences encoded by the polynucleotide sequences delineated in column 
six, and fragments and variants thereof. 

[023] Table 2 summarizes homology and features of some of the polypeptides of the 
invention. The first column provides a unique clone identifier, "Clone ID NO:Z", 
corresponding to a cDNA disclosed in Table lA. The second column provides the 
unique contig identifier, "Contig ID:" corresponding to contigs in Table lA and 
allowing for correlation with the information in Table 1 A. The third column provides 
the sequence identifier, "SEQ ID NO:X", for the contig polynucleotide sequences. 
The fourth column provides the analysis method by which the homology/identity 
disclosed in the row was determined. Comparisons were made between polypeptides 
encoded by the polynucleotides of the invention and either a non-redundant protein 
database (herein referred to as "NR"), or a database of protein families (herein refeired 
to as 'TFAM") as further described below. The fifth column provides a description of 
PFAM/NR hits having, significant matches to a polypeptide of the invention. Column 
six provides the accession number of the PFAM/NR hit disclosed in the fifth column. 
Column seven, "Score/Percent Identity^', provides a quality score or the percent 
identity, of the hit disclosed in column five. Columns 8 and 9, '^NT From" and 'Wl 
To" respectively, delineate the polynucleotides in "SEQ ID NO:X" that encode a 
polypeptide having a significant match to the PFAM/NR database as disclosed in the 
fifth colunom. In specific embodiments, polypeptides of the invention comprise, or 
alternatively consist of, an amino acid sequence encoded by the polynucleotides in 
SEQ ID NO:X as delineated in columns 8 and 9, or fragments or variants thereof 

[024] Table 3 provides polynucleotide sequences that may be disclaimed according to 
certain embodiments of the invention. The first column provides a unique clone 
identifier, "Clone ID NO:Z", for a cDNA clone related to nervous system associated 
contig sequences disclosed in Table lA. The second column provides the sequence 
identifier, "SEQ ID NO:X", for contig polynucleotide sequences disclosed in Table 
lA. The third column provides the unique contig identifier, "Contig ID", for contigs 
disclosed in Table lA. The fourth column provides a unique integer 'a' where *a' is 
any integer between 1 and the final nucleotide minus 15 of SEQ ID NO:X, represented 
as 'Hange of a", and the fif& column provides a unique integer 'b' where *b' is any 
integer between 15 and the final nucleotide of SEQ ID NO:X, represented as 'Range 
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of b", where both a and b correspond to the positions of nucleotide residues shown ir 
SEQ ID NO:X, and where b is greater than or equal to a + 14. For each of the 
polynucleotides shown as SEQ ID NO:X, the uniquely defined integers can be 
substituted into the general foimula of a-b, and used to describe polynucleotides which 
may be preferably excluded from the invention. In certain embodiments, preferably 
excluded from the polynucleotides of the invention (including polynucleotide 
fragments and variants as described herein and diagnostic and/or therapeutic uses 
based on these polynucleotides) are at least one, two, three, four, five, ten, or more of 
the polynucleotide sequence(s) having the accession number(s) disclosed in the sixth 
column of this Table (including for example, published sequence in connection with a 
particular BAG clone), hi fiirther embodiments, preferably excluded from the 
invention are the specific polynucleotide sequence(s) contained in the clones 
corresponding to at least one, two, three, four, five, ten, or more of the available 
material having the accession numbers identified in the sixth column of this Table 
(including for example, the actual sequence contained in an identified BAG clone). 

{025] Table 4 provides a key to the tissue/cell source identifier code disclosed in Table 
lA, column 7. Column 1 provides the key to the tissue/cell source identifier code 
disclosed in Table lA, Column 7. Columns 2-5 provide a description of the tissue or 
cell source. Codes corresponding to diseased tissues are indicated in column 6 with 
the word "disease". The use of the word "disease*' in column 6 is non-limiting. The 
tissue or cell source may be specific (e.g. a neoplasm), or may be disease-associated 
(e.g., a tissue sample from a nonnal portion of a diseased organ). Furthermore, tissues 
and/or cells lacking the "disease" designation may still be derived from sources 
directly or indirectly involved in a disease state or disorder, and therefore may have a 
ftirther utility in that disease state or disorder. In numerous cases where the tissue/cell 
source is a library, column 7 identifies the vector used to generate the library. 

[026] Table 5 provides a key to the OMIM™ reference identification numbers disclosed 
in Table lA, column 9. OMIM reference identification numbers (Column I) were 
derived from Online Mendelian Inheritance in Man (Online Mendelian Inheritance in 
Man, OMIM™. McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins 
University (Baltimore, MD) and National Center for Biotechnology hiformation. 
National Libraiy of Medicine, (Bethesda, MD) 2000. World Wide Web URL; 
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http://www.ncbi.nlm.mh.gov/omi^ Colximn 2 provides diseases associated with the 
cytologic band disclosed in Table lA, column 8, as determined fiom the Morbid Map 
database. 

[027] Table 6 summarizes ATCC Deposits, Deposit dates, and ATCC designation 
numbers of deposits made with the ATCC in connection with the present application. 

[028] Table 7 shows the cDNA libraries sequenced, tissue source description, vector 
information and ATCC designation numbers relating to these cDNA libraries. 

[0291 Table 8 provides a physical characterization of clones encompassed by the 
invention. The jBrst column provides the unique clone identifier, "Clone ID NO:Z", 
for certain cDNA clones of the invention, as described in Table lA. The second 
column provides the size of the cDNA insert contained in the corresponding cDNA 
clone. 

Definitions 

[030] The following definitions are provided to facihtate understanding of certain terms 
used throughout this specification. 

[031] In the present invention, "isolated" refers to material removed irom its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "hy the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. The 
term "isolated" does not refer to genomic or cDNA libraries, whole cell total or 
mRNA preparations, genomic DNA preparations (including those separated by 
electrophoresis and transferred onto blots), sheared whole cell genomic DNA 
preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide sequences of the present invention. 

[032] As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence encoding SEQ ID NO:Y or a fragment or variant thereof, a nucleic acid 
sequence contained in SEQ ID NO:X (as described in column 3 of Table lA) or the 
complement tiiereof, a cDNA sequence contained in Clone ID NO:Z (as described in 
column 1 of Table lA and contained within a library deposited with the ATCC); a 
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nucleotide sequence encoding the polypeptide encoded by a nucleotide sequence in 
SEQ ID NO:B as defined in column 6 of Table IB or a fragment or variant thereof; or 
a nucleotide coding sequence in SEQ ID NO:B as defined in column 6 of Table IB or 
the complement thereof For example, the polynucleotide can contain the nucleotide 
sequence of the full length cDNA sequence, including the 5* and 3' untranslated 
sequences, the coding region, as well as fi-agments, epitopes, domains, and variants of 
the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having an amino acid sequence encoded by a polynucleotide of the invention 
as broadly defined (obviously excluding poly-Phenylalanine or poly-Lysine peptide 
sequences which result fi-om translation of a polyA tail of a sequence corresponding to 
acDNA). 

[033J As used herein, a "nervous system antigen" refers collectively to any 
polynucleotide disclosed h&rem (e.g., a nucleic acid sequence contained in SEQ ID 
NO:X or the complement therof, or cDNA sequence contained in Clone ID NO:Z, or a 
nucleotide sequence encoding the polypeptide encoded by a nucleotide sequence in 
SEQ ID NO:B as defined in column 6 of Table IB, or a nucleotide coding sequence in 
SEQ ID NO:B as defined in column 6 of Table IB or the complement thereof and 
fragments or variants thereof as described herein) or any polypeptide disclosed herein 
(e.g., an amino acid sequence contained in SEQ ID NO:Y, an amino acid sequence 
encoded by SEQ JD NO:X, or the complement thereof, an amino acid sequence 
encoded by the cDNA sequence contained in Clone ID NO:Z, an amino acid sequence 
encoded by SEQ ID NO:B, or the complement thereof, and fragments or variants 
thereof as described herein). These nervous system antigens have been determined to 
be predominantly expressed in nervous system tissues, including normal or diseased 
tissues (as shown in Table 1 A column 7 and Table 4). 

I034J In the present invention, "SEQ ID NO:X" was often generated by overlapping 
sequences contained in multiple clones (contig analysis). A representative clone 
containing all or most of the sequence for SEQ ID NO:X is deposited at Human 
Genome Sciences, Inc. (HGS) in a catalogued and archived library. As shown, for 
example, in column 1 of Table 1 A, each clone is identified by a cDNA Clone ID 
(identifier generally referred to herein as Clone ID NO:Z). Each Clone ID is unique to 
an individual clone and the Clone ID is all the information needed to retrieve a given 
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clone from the HGS library. Furtheimore, certain clones disclosed in this application 
have been deposited with the ATCC on October 5, 2000, having the ATCC 
designation numbers PTA 2574 and PTA 2575; and on January 5, 2001, having the 
depositor reference numbers TS-1, AC-1, and AC-2. In addition to the 

individual cDNA clone deposits, most of the cDNA libraries from which the clones 
were derived were deposited at the American Type Culture Collection (hereinafter 
"ATCC"). Table 7 provides a list of the deposited cDNA libraries. One can use the 
Clone ID NO:Z to determine the library source by reference to Tables 6 and 7. Table 
7 lists the deposited cDNA libraries by name and links each library to an ATCC 
Deposit. Library names contain four characters, for example, *'HTWE/' The name of 
a cDNA clone (Clone ID NO:Z) isolated from that library begins with the same four 
characters, for example "HTWEP07". As mentioned below. Table lA correlates the 
Clone ID NO:Z names with SEQ ID NO:X. Thus, starting with an SEQ ID NO:X, one 
can use Tables lA, 6 and 7 to determine the corresponding Clone ID NO:Z, which 
library it came from and which ATCC deposit the library is contained in. Furthermore, 
it is possible to retrieve a given cDNA clone from the soiuce library by techniques 
known in the art and described elsewhere herein. The ATCC is located at 10801 
University Boulevard, Manassas, Virginia 20110-2209, USA. The ATCC deposits 
were made pursuant to the terms of the Budapest Treaty on the international 
recognition of the deposit of microorganisms for the purposes of patent procedure. 
[035J In specific embodiments, the polynucleotides of the invention are at least 15, at 
least 30, at least 50, at least 100, at least 125, at least 500, or at least 1000 continuous 
nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 
7.5 kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a further embodiment, 
. polynucleotides of the invention comprise a portion of the coding sequences, as 
disclosed herein, but do not comprise all or a portion of any intron. In another 
embodiment, the polynucleotides comprising coding sequences do not contain coding 
sequences of a genomic flanking gene (i.e., 5' or 3' to the gene of interest in the 
genome). In other embodiments, the polynucleotides of the invention do not contain 
the coding sequence of more than 1000, 500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 
1 genomic flanking gene(s). 
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[036] A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences 
contained in SEQ ID NO:X, or the complement thereof (e.g., the complement of any 
one, two, three, four, or more of the polynucleotide fragments described herein), the 
polynucleotide sequence delineated in colimms 8 and 9 of Table 2 or the complement 
thereof, and/or cDNA sequences contained in Clone ID NO:Z (e.g., the complement of 
any one, two, three, four, or more of the polynucleotide fragments, or the cDNA clone 
within the pool of cDNA clones deposited with the ATCC, described herein) and/or 
the polynucleotide sequence delineated in column 6 of Table IB or the complement 
thereof "Stringent hybridization conditions" refers to an overnight incubation at 42 
degree C in a solution comprising 50% formamide, 5x SSC (750 mM NaCl, 75 mM 
trisodiimi citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 fig/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in O.lx SSC at about 65 degree C. 

[037] Also contemplated are nucleic acid molecules that hybridize to the polynucleotides 
of the present invention at lower stringency hybridization conditions. Changes in the 
stringency of hybridization and signal detection are primarily accomplished through 
the manipulation of formamide concentration (lower percentages of fonnamide result 
in lowered stringency), salt conditions, or temperature. For example, lower stringency 
conditions include an overnight incubation at 37 degree C in a solution comprising 6X 
SSPE (20X SSPE - 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 
30% formamide, 100 ug/ml salmon sperm blocking DNA; followed by washes at 50 
degree C with IXSSPE, 0.1% SDS. In addition, to achieve even lower stringency, 
washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

[038] Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of alternate blocking reagents used to suppress 
background m hybridization experiments* Typical blocking reagents include 
Denhardt's reagent, BLOTTO, heparin, denatured sahnon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described* above, 
due to problems with compatibility. 
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I039J Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as 
any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or U) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone generated using oligo dT as a primer). 

[040] The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is mixture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA that 
. may be single-stranded or, more typically, double-stranded or a mixture of single- and 
double-stranded regions. In addition, the polynucleotide can be composed of triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. A polynucleotide 
may also contain one or more modified bases or DNA or RNA backbones modified for 
stability or for otiher reasons. "Modified" bases include, for example, tritylated bases 
and imusual bases such as inosine, A variety of modifications can be made to DNA 
and RNA; thus, "polynucleotide" embraces chemically, enzymatically, or 
metabolically modified forms. 

[041] The polypeptide of the presmt invention can be composed of amino acids joined to 
each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The polypeptides 
may be modified by either natural processes, such as posttranslational processing, or 
by chemical modification techniques which are well known in the art. Such 
modifications are well described in basic texts and in more detailed monographs, as 
well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in the same or varying degrees at several sites in a given polypq)tide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 
branched, for example, as a result of ubiquitination, and they may be cyclic, with or 
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without branching. Cyclic, branched, and branched cyclic polypeptides may result 
fiom posttranslation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine, 
formation of pyroglutamate, formylation, gamma-caiboxylation, glycosylation, GPI 
anchor formation, hydroxylation, iodination, methylation, myristoylation, oxidation, 
pes'lation, proteolytic processmg, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins 
such as arginylation, and ubiquitination. (See, for iristance, PROTEINS - 
STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton,W. H. 
Freeman and Company, New York (1993); POSTTRANSLATIONAL COVALENT 
MODIFICATION OF PROTEINS, B. C. Johnson, Ed., Academic Press, New York, 
pgs. 1-12 (1983); Seifter et al., Meth. Enzymol. 182:626-646 (1990); Rattan et al., 
Ann. N. Y. Acad. Sci. 663 :48-62 (1 992).) 
[042] "SEQ ID NO:X" refers to a polynucleotide sequence described, for example, in 
Tables lA or 2. while "SEQ ID NO:Y" refers to a polypeptide sequence described in 
column 5 of Table lA. SEQ ID NO:X is identified by an integer specified in column 3 
of Table l A. The polypeptide sequencfe SEQ ID NO:Y is a translated open reading 
firame (ORF) encoded by polynucleotide SEQ ID NO:X. "Clone ID NO:Z" refers to a 
cDNA clone described in coltunn 1 of Table 1 A. 
[043] "A polypeptide having biological activity" refers to a polypeptide exhibiting 
activity similar to, but not necessarily identical to, an activity of a polypeptide of the 
present invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. In the case where dose dependency does 
exist, it need not be identical to that of the polypeptide, but rather substantially similar 
to the dose-depaidence in a given activity as compared to the polypeptide of the 
present invention (i.e., the candidate polypeptide will exhibit greata- activity or not 
more than about 25-fold less and, preferably, not more than about tenfold less activity. 
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and most preferably, not more than about three-fold less activity relative to the 
polypeptide of the present invention). 
[044] Table 1 A sxmimarizes some of the polynucleotides encompassed by the invention 
(including contig sequences (SEQ ID NO:X) and clones (Clone ID NO:Z) and further 
summarizes certain characteristics of these polynucleotides and the polypeptides 
encoded thereby. 

Polvpucleotides and Polypeptides 
TABLE lA 
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L0439:4,S0010:2, 
H0051: 1 andHOlOO: 1. 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 and L0748: 1. 

SOOlO: 2 

SOOlO: 2 

SOOlO: 3, L0803: 2 and 
L0809: 1. 

SOOlO: 1 andS0260: 1. 

SOOlO: 2 

S6026: 1, SOOlO: 1 and 
S0346: 1. 

S0010:2andL0766: 2. 

H0052: 3, S6024: 1, 
H0261: 1 and SOOlO: 1. 

H0051:3, S0036:2, 
S0035: 1, SOOlO: 1 and 
S0260: 1. 

SOOlO: 2 

SOOlO: 1,S0346: 1, 
L0770: 1 andL0790: 1. 

L0742:3,L0769: 2, 
S6024: 1,S0222: 1, 









Pro-24toVal-29, 
Asp-37 to His-43, 
Lys-58 to Asn-66. 

Ser-1 to Gly-7. 

Glu-1 to Lys-8, 
Asp- 19 to Pro-30, 
Ile-33 to Cys-40. 

Pro-38 to Arg-50. 

Trp-1 to Trp-1 1, 
Asp-17toCys-24. 


Gln-6 to Trp-13. 
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1 HAGCC42 

HAGCE06 

HAGCE07 

HAGCF46 

HAGCM32 

HAGCM64 

HAGCS70 

HAGCV72 

HAGCX13 
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SOOlO: 2 

1 SOOlO: 2 

SOOlO: 2 

SOOlO: 1 andSOOSO: 1. 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 

S6024: 1,80010: 1, 
L0744: 1 andL0439: 1. 

SOOlO: 2 

S0222: 2, SOOlO: 1, 
S0346: 1,L0438: 1 and 
L0439: 1. 

SOOlO: 2, L0438:2, 
L0439:2andH0566: 1. 

SOOlO: 2 

Asn-19toSer-25, 
Gly-50 to Cys-57, 
Tyr-64 to Ser-70. 

Ser-2 to Gly-8. 


Ser-10 to Ser-28. ~ 

Val-25 toTrp-32. 


Ile-7 to Asn-19. 

Trp-19 to Lys-33. 

^ DO 

o o 

CN ^ 

7 «^ 

Glu-7 to Gb-16, 
Arg-24toArg-31, 
Gly-52 to Gly-58. 

Val-17 to Lys-22, 
Glu-40 to Pro-52. 

Glu-6 to Val-12, 
Thr-14 to Val-22, 
Ala-31 toPro-49. 


Leu-1 to His-8. 

Arg-13 to Phe-22, 
Asn-24to Ala-31, 
Glu-48 to Tyr-60. 
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HAGFE37 

1 

00 

1 

00 

o 

VO 
00 


wo 01/59063 


PCT/USOl/01334 


































H0442: 1 and SOOlO: 1. 

SOOlO: 2 

S0010:2andH0051: 1. 

SOOlO: 3, L0770: 3, 
L0439: 3, L0438: 2, 
L0777: 2,L0630: 1, 
L0764: 1,L0794: 1, 
L0352:landL0743: 1. 

S0346: 2 

L0439: 3, L0438: 2, 
H0229: 1, S0665: 1, 
S0346: 1, S0049: 1 and 
L0366: 1. 

S0346: 2 

S0346: 2 

SOOlO: 1 andS0346: 1. 

S0346: 1 andH0009: 1. 

S0346: 2 

L0005: 1, SOOlO: 1 and 
S0346: 1. 

L0439: 4, S0400: 1, 
S0346: 1,H0374: 1 and 
S0260: 1. 

S0346: 2 

S0346: 2 

L0439: 2, S0665: 1, 

1 Pro-20 to Asn-26. 

Pro-lOtoSer-15. 


His-38 to Ser-52. 


Phe-15toSer-24. 


Ala-5 to Leu-IL 

Gln-25 to Phe-32. 


Pro-22 to Gly-28, 
Gly-37 to Gb-42, 
Asn-71 to Tyr-82. 


Asn-38 to Asp-44. 


Pro-4 to Lys-29. 

Ser-12toGly-18, 
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S0346: 1, L0352: 1 and 
L0366: 1. 

T0082: 1,H0009: 1, 
L0521:landL0361: 1. 

S6026: 1 andT0082: 1. 

S0222: 1, T0082: 1 and 
H0009: 1. 

T0082: 2 

T0082: 2 

T0082: land 80010:1, 

AR061: 5,AR089: 3 
H0438: 1 and T0082: 
1. 

Thr-31toGly-39, 
His-46 to Arg-51, 
Ser-58 to Asp-85. 


Lys-52 to Ser-57. 


Leu-19toGly-29, 
Val-32 to Ser-39. 

Ala-1 to Ala-12, 
Arg-16toThr-23, 
Arg-36 to Ser-44. 

Lys-U to Met-16, 
His-28 to Arg-33, 
Ala-65 to Asn-72. 

Phe-8toPhe-15. 
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Nucleic Acids, Proteins, and Antibodies 

[001] This application refers to a "Sequence Listing" that is provided only on electronic 
media in computer readable form pursuant to Administrative Instructions Section 
801(a)(i). The Sequence Listing forms a part of this description pursuant to Rule 5.2 
and Administrative Instructions Sections 801 to 806, and is hereby incorporated in its 
entirety. 

[002] The Sequence Listing is provided as an electronic file (PC010PCT_seqList.txt, 
23,109,132 bytes in size, createcl on January 12, 2001) on four identical compact discs 
(CD-R), labeled "COPY 1," "COPY 2," "COPY 3," and "CRF." The Sequence 
Listing complies with Annex C of the Administrative Instructions, and may be viewed, 
for example, on an IBM-PC machine running the MS-Windows operating system by 
using the V viewer software, version 2000 (see World Wide Web URL: 
http://www.fileviewer.com). 

Field of the Invention 
[003] The present invention relates to novel nervous system related polynucleotides, the 
polypeptides encoded by these polynucleotides herein collectively refOTed to as 
'^nervous system antigens," and antibodies that immimospecifically bind these 
polypeptides, and the use of such nervous systOTi polynucleotides, antigens, and 
antibodies foi; detecting, treating, preventing and/or prognosing disorders of the 
nervous system, including, but not limited to, the presence of cancers of the nervous 
-system and metastases of nervous system cancers. More specifically, isolated nervous 
system nucleic acid molecules are provided encoding novel nervous system 
polypeptides. Novel nervous system polypeptides and antibodies that bind to these 
polypeptides are provided. Also provided atre vectors, host cells, and recombinant and 
synthetic methods for producing human nervous system polynucleotides, polypeptides, 
and/or antibodies. The invention further relates to diagnostic aiiid therapeutic methods 
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useful for diagnosing, treating, preventing and/or prognosing disorders related to the 
nervous system, including cancers of the nervous system, and therapeutic methods for 
treating such disorders. The invention fiirther relates to screening methods for 
identifying agonists and antagonists of polynucleotides and polypeptides of the 
invention. The invention further relates to methods and/or compositions for inhibiting 
or promoting the production and/or function of the polypeptides of the invention. 

Background of the Invention 

[004] The brain is the control center of the body, encoding such functions as the ability 
to move, touch, taste, smell, hear, and see, for example. It reviews all stimuli, whether 
from intemal organs or the surface of the body, and generates a reaction, such as 
movement of the limbs, adjustment of the rate at which intemal organs function, 
and/or alteration of mood. Stimuli and reactions are transmitted to and from the brain 
via the spinal cord, a collection of nerves encased within bony vertebrae. Both the 
brain and spinal cord are wrapped in three layers of tissue, collectively called the 
meninges, which provide cushioning and protection. Together, these components 
make up the central nervous system (CNS). 

[005] The human brain is subdivided into three major segments: the brain stem, 
midbrain, and forebrain. The brain stem is considered to be the seat of the "primitive 
brain". It comprises such structures as the medulla and cerebellum, which control 
basic functions like breathing, heart rate, and digestion and the coordination of the 
senses and muscle movement, respectively. Many of these features are homologous 
across species. The midbrain controls many sensoiy and motor functions, including 
eye movement, and links the brain stem to such structures as the thalamus (for 
information relay) and hypothalamus (which is instrumental in regulating autonomic 
functions, like maintaining body temperature, regulating water balance, and 
controlling sleep). The forebrain is associated with the *liigh-lever' functions of 
complex organisms. This area includes specialized regions for the control of skilled 
motor behaviors (e.g., speech, mood, thought, and plaiming for the future), 
interpretation of sensory input from the rest of the body, control of voluntary body 
movements, interpretation of vision, retrieval of long-term memories, recognition of 
familiar objects, and initiation of communication or action. 
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[006] Despite being encapsulated in the thick, hard bones of the skull, the brain is 
susceptible to many kinds of injury. Common injuries resulting from head trauma 
include herniation, edema, hematomas ( subdural and epidural) , amnesia, coma, 
stupor, delirium, persistent or chronic vegetative state, concussion and post-concussion 
syndrome, cerebral contusions, damage to specific brain areas (e.g., the aphasias, 
apraxia, agnosia, and amnesia), and posttraumatic epilepsy. Bacteria and other 
infectious organisms can reach the CNS in through the blood stream or by penetration 
through an injury or surgery wound, leading to several serious diseases, such as 
bacterial meningitis, Waterhouse-Friderichsen syndrome, chronic meningitis, viral 
meningitis (e.g., lyphocytic choriomeningitis), bactaial meningitis (e.g., Haemophilus, 
Listeria, Meningococcal, pneumococcal, or meningeal tuberculosis), encephalitis, 
encephalomyelitis, Hallervorden-Spatz syndrome, aseptic meningitis, parainfectious 
encephalitis, subacute sclerosing panencephalitis, brain abscesses, AE)S dementia 
complex, Japanese encephalitis, St. Louis encephalitis. Tick-bom encephalitis. West 
Nile Fever encephahtis, postencephalitic Parkinson disease, necrotizing hemorrhagic 
encephalomyelitis, visna, cerebral malaria, neurosyphilis (e.g., tabes dorsalis), 
subdural empyema, cysticercosis, schistosomiasis, echinococcosis, coenurosis, 
cerebral toxoplasmosis, and prion diseases (e.g., Creutzfelt-Jakob syndrome, bovine 
spongiform encephalopathy, Gerstmann-Straussler syndrome, kum, or scrapie). Other 
brain diseases include hydrocephalus (e.g., Dandy-Walker syndrome or normal 
pressure hydroencephalitis), Rhett syndrome, Reye's syndrome, pseudotumor cerebri, 
intracranial tuberculoma, Zellweger syndrome, narcolepsy, cataplexy, and cerebellar 
diseases. The spinal cord is equally susceptible to injury and disease, which can result 
in cervical spondylosis, cysts, acute transverse myehtis, spinal hematoma, nerve root 
disorders (e.g., sciatica, spinal stenosis, and shingles), ruptured disk, and spinal cord 
compression. Together, this illustrates the relative frailty of the CNS. 
[007] The peripheral nervous system (PNS) includes all nerves outside the CNS: the 
cranial nerves that connect the head and face directly to the brain, the nerves that 
connect the eyes and nose to the brain, and all the nerves that connect the spinal cord 
to the rest of the body. The brain communicates with much of the body through the 
thirty-one pairs of spinal nerves that emerge from the spinal cord. Each pair iricludes 
one nerve at the front of the spinal cord, which carries information from the brain to 

3 


wo 01/59063 


PCT/USOl/01334 


the muscles, and one nerve located at the back of the spinal cord, which carries 
sensory information to the brain. Peripheral nerves are actually bundles of nerve fibers 
- some of which are very small (less than 1/64 of and inch in diameter) and others are 
quite large. Large fibers convey the messages that activate muscles (motor nerves) 
and the sensations of touch and position (sensory nerves), whereas small fibers 
convey sensations of pain and temperature and control the automatic functions of the 
body, such as heart rate and blood pressure (autonomic nerves). 
[0081 Like the CNS, damage to the PNS results in several known disease states with 
effects seen throughout the body. Disorders of muscle stimulation include 
amyotrophic lateral sclerosis, progressive muscular atrophy, progressive bulbar palsy, 
Werdnig-Hoffinan disease, intermediate spinal muscular atrophy, infantile and 
juvenile muscular atrophy, poliomyelitis and the post poho syndrome, primary lateral 
sclerosis, Wohlfart-Kugelberg-Welander disease, and progressive pseudobulbar palsy. 
Malfunction of the cranial nerves that lead directly fi-om the brain to various parts of 
the head also results in several known disorders, such as trigeminal neuralgia, 
glossopharyngeal neuralgial, and Bell's palsy. Other diseases of the PNS include 
plexus disorders (e.g., acute brachial neuritis), thoracic outlet syndromes, 
mononeuropathy (e.g., caipal tunnel syndrome, leprosy, ulnar nerve palsy, radial nerve 
palsy, and peroneal nerve palsy), multiple mononeuropathy, polyneuropathy (e.g., 
chronic polyneuropathy and diabetic neuropathy), Guillain-Barre syndrome, and 
heredtiary neuropathies (e.g., Charcot-Marie-Tooth disease and Dejerine-Sottas 
disease). 

[009] Nerve cells are the fundamental elements of both the CNS and PNS. In total, there 
are an estimated 100 billion neurons in a human body. While neurons are similar to 
other cells of the body in their general organization, they also posses highly 
specialized and unique features which are critical to the function of the nervous 
systCTi. Each neuron is comprised of four distinct regions: the cell body, a single 
axon, dendrites, and axon terminals. The cell body contains the nucleus and other 
organelles necessary for the life and functioning of the neuron. The dendrites are 
processes that extend outward &om the cell body and receive signals fi-om sensory 
organs or firom other neurons, hi the dendrites, incoming signals are converted to 
electrical impulses and transmitted to the cell body for processing. A single axon 
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extends from the cell body, which conducts infoimation from the cell body to organs, 
mixscles, or other neurons. At the end of the axon is an array of axon termini. These 
termini are the transmitting elements of a neuron. By means of these termini, an axon 
is able to transmit information to the receptive surfaces (typically the dendrites or the 
cell body) of other neuronis or muscle cells. 
[010] Other cellular structures crucial for neural transmission are the cytoskeletal fibers, 
including microtubules and neurofilaments, which run the length of the axon and 
function in transporting proteins, vesicles, and other macromolecules to the axon 
terminal. Additionally, some axons are surrounded by a myelin sheath made up of 
membranes from either oligodendrocyte cells (CNS) or Schwann cells (PNS). 
Myelinated axons conduct electrical impulses faster than unmyelinated ones of the 
same diameter. Damage to the myelin sheath has been associated with several known 
disease states, including multiple sclerosis, acute disseminated encq)halomyelitis, 
Canavan disease, diffuse cerebral sclerosis, encephalitis periaxialis, global cell 
leukodystrophy, metachromatic leukodystrophy, allergic encephalomyelitis, 
necrotizing hemorrhagic encephalomyelitis, progressive multifocal 
leukoencephalopathy, central pontine myelinolysis, transverse myelinolysis, 
neuromyelitis optica, scrapie, swayback, adrenoleukodystrophy, 
adrenomyeloneuropathy, Leber's hereditary optic atrophy, and HTLV-associated 
myelopathy. 

[Oil J Contact between neurons occurs at a speciaUzed site called a synapse. At this site, 
the axon temiinal from one neuron (the presyn2q)tic cell) sends a signal to another 
neuron (the postsynaptic cell). Synapses may be connected either electrically or 
chemically. An electrical synapse consists of gap junctions that directly connect two 
neurons. This allows electrical signals to pass imabated from the presynaptic to 
postsynaptic neuron. 

[012] The electrical signals are produced by temporary changes in the current flow into 
and out of the cell. Ion channels embedded in the membrane regulate current flow by 
selectively regulating the passage of a specific ion or ions across the membrane. There 
are two types of ion channels found in neural membranes - gated and non-gated. Non- 
gated channels are always open and are not significantly influenced by changes in 
extemal factors. These ion channels primarily fimction in maintaining the resting 
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membrane potential, or electrical potential across the membrane, of the neuron. Gated 
channels, in contrast, exist in two stable conformations - open and closed. Most gated 
channels are closed when the membrane is at its resting potential, and open when 
stimulated by extemal factors such as a change in membrane potential, ligand binding, 
or membrane stretch. Agonists, antagonists, and antibodies that bind to or block ion 
channels are extremely usefiil tools for studying brain function, which could lead to 
significant advances in understanding disease and the development of therapies. For 
example, tertrodotoxin (TTX), isolated from the poison sacks of the puffer fish, 
selectively blocks the voltage-gated sodium channels necessary for producing an 
excitatory electrical potential. This provides the researcher with a powerful tool for 
studying the effects of activity blockade on such processes as neural network 
development, learning and memory. 
[013] In chemical synapses, the axon termini of the presynaptic cell contain vesicles 
filled with a particular molecule (neurotransmitter). An electrical signal from the cell 
body travels down the axon to the axon termini, where it triggers the release of 
neurotransmitter from the vesicle by exocytosis. The neurotransmitter rapidly diffuses 
across the synaptic clefl separating the presynaptic from the postsyn^tic neuron. The 
neurotransmitter then binds to recq)tors located on the dendrites of the postsynaptic 
neuron, which open ion channels and provokes a qhange in the cell's electric potential. 
This change in electrical potential prompts further transmission of the signal. 
[014] Signal transmission between a neuron and muscle cell occurs via a similar 
mechanism. At the neuromuscular junction, axon termini reside adjacent to muscle 
cells within depressions formed in the motor end-plate. An electrical signal prompts 
the release of neurotransmitter from axon termini, which diffuses across the synaptic 
cleft and binds to receptors located on the surface of the muscle cell. Binding of 
neurotransmitter provokes an electrical response that stimulates contraction of the 
muscle. Dysfunction of the neuromuscular junction plays a role in several 
neurological disorders. For example, in myasthenia gravis the immune system 
produces antibodies that attack the neurotransmitter receptors located on the muscle, 
preventing neurotransmitter binding and muscle contraction. Additionally, these 
antibodies can also be transferred from mother to child through the placenta, resulting 
in a variation of the disease called neonatal myasthenia, whose symptoms typically 
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disappear shortly after birth. Other known neuromuscular junction disorders include 
Eaton-Lambert syndrome and botulism. 

[015] Neurotransmitters comprise a diverse group of small molecules, such as L- 
glutamine and acetylcholine, or peptides like enkephalin (McCance and Huenther, 
Pathophysiology, the Biological Basis for Disease in Adults and Humans,2"'' edition, 
, pp.403-404 (1994)). Neurotransmitters are synthesized within the cell body of the 
presynaptic neuron and transported to the axon termini in vesicles, where they reside 
until exocytosed. The effects of neurotransmitters can be excitatory (e.g initiation of 
neuron stimulation) or inhibitory (e.g., to hyperpolarize the plasma membrane and 
inhibit signal transmission). Many neurotransmitters are capable of eliciting either an 
excitatory or inhibitory response, dependent on the number and type of receptors 
located on the postsynaptic neuron. 

[016] Current medical research efforts have identified a role for neurotransmitters and 
their receptors as targets of pharmacological agents aimed at controlling neurological 
function. For e^^ample, sedatives, such as benzodiazepines and barbituates, mimic the 
efifect of the neurotransmitter GABA, which is known to be the primary inhibitory 
neurotransmitter in the CMS (Katzung, Basic and Clinical Pharmacology, 6* edition, 
338-339 (1995)). The aberrant activity of neurotransmitters and their receptors has 
been linked to a number of neurological disorders, including Alzheimer's disease, 
Parkinson's disease, epilepsy, stroke, and myasthenia gravis (Planells-Cases et ah, 
PNAS 90: 5057-5061 (1 993)), identifying an important need for the discovery of novel 
polypeptides, agonists, antagonists, and corresponding to neurotransmitters. 

[0171 hi adult humans, each neuron is coxmected to approximately ten thousand other 
neurons (Tessier-Lavigne et al.. Science 274: 1123-1133 (1996)). While the overall 
program for determining which neurons should be coimected together is under genetic 
control, it is external stimuli from sensory neurons that are crucially important in 
determining what network connections are actually made. To clarify, precise neural 
wiring is not fully developed at birth, but only roughly approximates the final network 
required to be fiilly fimctional. During embryonic developmerit, neural connections are 
initiated via the programmed extmsion of axons, tipped at the leading end with a 
growth cone that is guided by molecular cues. Throughout post-natal development, 
this coarse pattern of connections is refined based on specific interactions between the 

7 


wo 01/59063 


PCT/USOl/01334 


organism and its environment - essentially/through learning. This process can be 
modulated by normal and aberrant experiences, both having a more profound effect 
during early stages of postnatal development than in adulthood. 

[018] Several changes occur in the brain throughout aging. Gross changes include 
decreases in brain weight, the production of certain proteins, and the total number of 
neurons in many brain regions. Additionally, there are age-related alterations in the 
synthesis and degradation of neurotransmitters and their receptors that are believed to 
cause some of the characteristics of senescence: changes in sleep patterns, mood, 
appetite, neuroendocrine functions, motor activity, and memory. While these changes 
are considered normal, a number of diseases have been identijBed that result from 
aberrant age-related changes, such as Alzheimer's disease, Parkinson's disease, 
Huntington*s diseiase, Pick's disease, and dementia. 

[019] The discovery of new human nervous system associated polynucleotides, the 
polypeptides encoded by them, and antibodies that immunospecifically bind these 
polypeptides, satisfies a need in the art by providing new compositions which are 
useful in the diagnosis, treatment, prevention and/or prognosis of disorders of the 
nervous system, including, but not limited to, neuropsychiatric disorders, 
neurodegenerative diseases, vascular disorders, developmental disorders, infections, 
and neoplastic disorders. 


Summary of the Invention 
[020] The present invention relates to novel nervous system related polynucleotides, the 
polypeptides encoded by these, polynucleotides herein collectively referred to as 
"nervous system antigens," and antibodies that immunospecifically bind these 
polypeptides, and the use of such nervous system polynucleotides, antigens, and 
antibodies for detecting, treating, preventing and/or prognosing disorders of the 
nervous system, including, but not limited to, the presence of cancers of the nervous 
system and metastases of cancers of the nervous system. More specifically, isolated 
nervous system nucleic acid molecules are provided encoding novel nervous system 
polypeptides. Novel nervous system polypeptides and antibodies that bind to these 
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polypeptides are provided. Also provided are vectors, host cells, and recombinant and 
synthetic methods for producing human nervous system polynucleotides, polypeptides, 
and/or antibodies. The invention further relates to diagnostic and therapeutic methods 
useful for diagnosing, treating, preventmg and/or prognosing disorders related to the 
nervous system, including cancers of the nervous system, and therapeutic methods for 
treating such disorders. The invention further relates to screening methods for 
identifying agonists and antagonists of polynucleotides and polypeptides of the 
invention. The invention ftirther relates to methods and/or compositions for inhibiting 
or promoting the production and/or function of the polypeptides of the invention. 

Detailed Description 

Tables 

[021] Table lA summarizes some of the polynucleotides encompassed by the invention 

(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences 

(contig identifier (Contig ID:) and contig nucleotide sequence identifier (SEQ ID 

NO:X)) and further summarizes certain characteristics of these polynucleotides and the 

polypeptides encoded thereby. The first column provides a unique clone identifier, 

"Clone ID NO:Z", for a cDNA plasmid related to each nervous system associated 

contig sequence disclosed in Table 1 A. The second column provides a unique contig 

identifier, "Contig ID:" for each of the contig sequences disclosed in Table lA. The 

third column provides the sequence identifier, "SEQ ID NO:X", for each of the contig 

polynucleotide sequences disclosed in Table lA. The fourth column, "ORF (From- 

To)", provides the location (i.e., nucleotide position numbers) within the 

polynucleotide sequence of SEQ ID NO:X that delineate the preferred open reading 

frame (ORF) shown in the sequence listing and referenced in Table lA as SEQ ID 

NO:Y (column 5). Column 6 lists residues comprising predicted epitopes contained in 

the polypeptides encoded by each of the preferred ORFs (SEQ ID NO:Y). 

Identification of potential immunogenic regions was performed according to the 

method of Jameson and Wolf (CABIOS, 4:181-186 (1988)); specifically, flie Genetics 

Computer Group (GCG) implementation of this algorithm, embodied in the program 

PEPTIDESTRUCTURE (Wisconsin Package vlO.O, Genetics Computer * Group 

(GCG), Madison, Wise). This method returns a measure of the probability that a given 
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residue is found on the surface of the protein. Regions where the antigenic index score 
is greater than 0,9 over at least 6 amino acids are indicated in Table 1 A as "Predicted 
Epitopes." In particular embodiments, nervous system associated polypeptides of the 
invention comprise, or alternatively consist of, one, two, three, four, five or more of 
the predicted epitopes described in Table 1 A. It will be appreciated that depending on 
the analytical criteria used to predict antigenic detenninants, the exact address of the 
determinant may vary slightly. Column 7, "Tissue Distribution" shows the expression 
profile of tissue, cells, and/or cell hne libraries which express the polynucleotides of 
the invention. The first number in column 7 (preceding the colon), represents the 
tissue/cell source identifier code corresponding to the code and description provided in 
Table 4. Expression of these polynucleotides was not observed in the? other tissues 
and/or cell Ubraries tested. For those identifier codes in which the first two letters are 
not "AR", the second number m column 7 (following the colon), represents the 
number of times a sequence corresponding to the reference polynucleotide sequence 
(e.g., SEQ E) NO:X) was identified in the tissue/cell source. Those tissue/cell source 
identifier codes in which the first two letters are "AR" designate infomiation 
generated using DNA array technology. Utilizing this technology, cDNAs were 
amplified by PGR and then transferred, in dupHcate, onto the array. Gene expression 
was assayed through hybridization of first strand cDNA probes to the DNA array. 
cDNA probes were generated &om total RNA extracted &om a variety of different 
tissues and cell lines. Probe synthesis was performed in the presence of ^^P dCTP, 
using olig9(dT) to prime reverse transcription. After hybridization, high stringency 
.washing conditions were employed to remove non-specific hybrids fi*om the array. The 
remaining signal, emanating from each gene target, was measured using a 
Phosphorimager. Gene expression was reported as Phosphor Stimulating 
Luminescence (PSL) which reflects the level of phosphor signal generated firom the 
probe hybridized to each of the gene targets represented on the array. A local 
background signal subtraction was performed before the total signal generated from 
each array was used to normalize gene expression between the different 
hybridizations. The value presented after "[array code]:" represents the mean of the 
duplicate values, following background subtraction and probe normalization. • One of 
skill in the art could routinely use this information to identify normal and/or diseased 
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tissue(s) which show a predominant expression pattern of the corresponding 
polynucleotide of the invention or to identify polynucleotides which show 
predominant and/or specific tissue and/or cell expression. Column 8, "Cytologic 
Band," provides the chromosomal location of polynucleotides corresponding to SEQ 
ID NO:X. Chromosomal location was deteraiined by finding exact matches to EST 
and cDNA sequences contained in the NCBI (National Center for Biotechnology 
Infomiation) UniGene database. Given a presumptive chromosomal location, disease 
locus association was deteraiined by comparison with the Morbid Map, derived fi-om 
Online Mendehan Inheritance in Man (Online Mendehan Inheritance in Man, 
OMIM™. McKusick-Nathans Institute for. Genetic Medicine, Johns Hopkins 
University (Baltimore, MD) and National Center for Biotechnology Moraiation, 
National Library of Medicine (Bethesda, MD) 2000. World Wide Web URL: 
http://www.ncbi.nlm.nih.gov/omim/). If the putative chromosomal location of the 
Query overlapped with the chromosomal location of a Morbid Map entry, an OMIM 
identification number is provided in Table lA, column 9 labeled "OMIM Disease 
Reference(s)". A key to the OMIM reference identification numbers is provided in 
Table 5. 

[0221 Table IB summarizes additional polynucleotides encompassed by the invention 
(including cDNA clones related to the sequences (Clone ID NO:Z), contig sequences 
(contig identifier (Contig ID:) contig nucleotide sequence identifiers (SEQ ED NO:X)), 
and genomic sequences (SEQ ID NO:B). The first column provides a unique clone 
identifier, "Clone ID NO:Z", for a cDNA clone related to each contig sequence. The 
second column provides the sequence identifier, "SEQ ID NO:X", for each contig 
sequence. The third column provides a unique contig identifier, "Contig ID:" for each 
contig sequence. The fourth column, provides a BAC identifier "BAC ID NO:A" for 
the BAC clone referenced in the corresponding row of the table. The fifth column 
provides the nucleotide sequence identifier, "SEQ ID NO:B" for a fragment of die 
BAC clone identified in column four of the corresponding row of the table. The sixth 
column, "Exon From-To", provides the location (i.e., nucleotide position numbers) 
within the polynucleotide sequence of SEQ ID NO:B which delineate certain 
polynucleotides of the invention that are also exemplary members of polynucleotide 
sequences that encode polypeptides of the invention (e.g., polypeptides containing 
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amino acid sequences encoded by the polynucleotide sequences delineated in column 
six, and fragments and variants thereof). 

[0231 Table 2 summarizes homology and features of some of the polypeptides of the 
invention. The first colunm provides a unique clone identifier, "Clone ID NO:Z", 
corresponding to a cDNA disclosed in Table lA. The second column provides the 
unique contig identifier, "Contig ID:" corresponding to contigs in Table lA and 
allowing for correlation with the information in Table 1 A. The third column provides 
the sequence identifier, "SEQ ID NO:X*\ for the contig polynucleotide sequences. 
The fourth column provides the analysis method by which the homology/identity 
disclosed in the row was determined. Comparisons were made between polypeptides 
encoded by the polynucleotides of the invention and either a non-redundant protein 
database (herein referred to as "NR")» or a database of protein families (herein referred 
to as "PFAM") as further described below. The fifth column provides a description of 
PFAM/NR hits having significant matches to a polypeptide of the invention. Column 
six provides the accession number of the PFAM/NR hit disclosed in the fifth column. 
Column seven, "Score/Percent Identity", provides a quality score or the percent 
identity, of the hit disclosed in column five. Columns 8 and 9, **NT From" and 'TSfT 
To" respectively, delineate the polynucleotides in "SEQ ID NO:X" that encode a 
polypeptide having a significant match to the PFAM/NR database as disclosed in the 
fifth column. In specific embodiments, polypeptides of the invention comprise, or 
altematively consist of, an amino acid sequence encoded by the polynucleotides in 
SEQ ID NO:X as delineated in columns 8 and 9, or fiagments or variants thereof. 

[024] Table 3 provides polynucleotide sequences that may be disclaimed according to 
certain embodiments of the invention. The first column provides a unique clone 
identifier, "Clone ID NO:Z", for a cDNA clone related to nervous system associated 
contig sequences disclosed in Table lA. The second column provides the sequence 
identifier, "SEQ ID NO:X", for contig polynucleotide sequences disclosed in Table 
lA. The third column provides the unique contig identifier, "Contig ID", for contigs 
disclosed in Table lA. The fourth column provides a unique integer 'a' where *a' is 
any integer between 1 and the final nucleotide minus 15 of SEQ ID NO:X, represented 
as "Range of a", and the fifth colunm provides a unique integer *b' where 'b* is any 
integer between 15 and the final nucleotide of SEQ ID NO:X, represented as •'Range 
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of b*', where both a and b correspond to the positions of nucleotide residues shown ir. 
SEQ ID NO:X, and where b is greater than or equal to a + 14. For each of the 
polynucleotides shown as SEQ ID NO:X, the uniquely defined integers can be 
substituted into the general formula of a-b, and used to describe polynucleotides which 
may be preferably excluded jfrom the invention. In certain embodiments, preferably 
excluded from the polynucleotides of the Invention (including polynucleotide 
fragments and variants as described herein and diagnostic and/or therapeutic uses 
based on these polynucleotides) are at least one, two, three, four, five, ten, or more of 
the polynucleotide sequence(s) having the accession number(s) disclosed in the sixth 
column of this Table (including for example, pubhshed sequence in connection with a 
particular BAC clone). In further embodiments, preferably excluded &om the 
invention are the specific polynucleotide sequence(s) contained in the clones 
corresponding to at least one, two, three, four, five, ten, or more of the available 
material having the accession numbers identified in the sixth colunm of this Table 
(including for example, ttie actual sequence contained in an identified BAC clone). 

i025] Table 4 provides a key to the tissue/cell source identifier code disclosed in Table 
lA, column 7. Column 1 provides the key to the tissue/cell source identifier code 
disclosed in Table 1 A, Colunm 7. Columns 2-5 provide a description of the tissue or 
cell source. Codes corresponding to diseased tissues are indicated in column 6 with 
the word "disease". The use of the word "disease'' in column 6 is non-limiting. The 
tissue or cell source may be specific (e.g. a neoplasm), or may be disease-associated 
(e.g., a tissue sample from a normal portion of a diseased organ). Furthermore, tissues 
and/or cells lacking the "disease** designation may still be derived fi'om sources 
directly or indirectly involved in a disease state or disorder, and therefore may have a 
fiirther utility in that disease state or disorder. In nmnerous cases where the tissue/cell 
source is a library, column 7 identifies the vector used to generate the library. 

[026J Table 5 provides a key to the OMM™ reference identification numbers disclosed 
in Table lA, column 9. OMIM reference identification numbers (Column 1) were 
derived from Online Mendelian Liheritance in Man (Online Mendelian Inheritance in 
Man, OMIM™. McKusick-Nathans Institute for Genetic Medicine, Johns Hopkins 
University ^Baltimore, MD) and National Center for Biotechnology Information, 
National Library of Medicine, (Bethesda, MD) 2000. World Wide Web URL; 
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http://ww.ncbi.iilm.iiih.gov/oniiin/). Column 2 provides diseases associated with the 
cytologic band disclosed in Table lA, column 8, as deteraiined from the Morbid Map 
database. 

[027] Table 6 summarizes ATCC Deposits, Deposit dates, and ATCC designation 
numbers of deposits made with the ATCC in connection with the present application. 

[028] Table 7 shows the cDNA libraries sequenced, tissue source description, vector 
information and ATCC designation numbers relating to these cDNA libraries. 

[029] Table 8 provides a physical characterization of clones encompassed by the 
invention. The first column provides the unique clone identifier, "Clone ID NO:Z", 
for certain cDNA clones of the invention, as described in Table lA. The second 
column provides the size of the cDNA insert contained in the corresponding cDNA 
clone. 

Definitions 

[030] The following definitions are provided to facilitate understanding of certain temis 
used throughout this specification. 

[031] In the present invention, "isolated" refers to material removed from its original 
environment (e.g., the natural environment if it is naturally occurring), and thus is 
altered "by the hand of man" from its natural state. For example, an isolated 
polynucleotide could be part of a vector or a composition of matter, or could be 
contained within a cell, and still be "isolated" because that vector, composition of 
matter, or particular cell is not the original environment of the polynucleotide. The 
term "isolated" does not refer to genomic or cDNA libraries, whole cell total or 
mRNA preparations, genomic DNA preparations (including those separated by 
electrophoresis and transferred onto blots), sheared whole cell genomic DNA 
preparations or other compositions where the art demonstrates no distinguishing 
features of the polynucleotide sequences of the present invention. 

[032] As used herein, a "polynucleotide" refers to a molecule having a nucleic acid 
sequence encoding SEQ ID NO:Y or a fragment or variant thereof, a nucleic acid 
sequence contained in SEQ ID NO:X (as described in column 3 of Table lA) or the 
complement thereof, a cDNA sequence contained in Clone ID NO:Z (as described in 
column 1 of Table lA and contained within a library deposited with the ATCC); a 
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nucleotide sequence encoding the polypeptide encoded by a nucleotide sequence in 
SEQ ID NO:B as defined in column 6 of Table IB or a fragment or variant thereof; or 
a nucleotide coding sequence in SEQ ID NO:B as defined in column 6 of Table IB or 
the complement thereof. For example, the polynucleotide can contain the nucleotide 
sequence of the full length cDNA sequence, including the 5' and 3' untranslated 
sequences, the coding region, as well as fragments, epitopes, domains, and variants of 
the nucleic acid sequence. Moreover, as used herein, a "polypeptide" refers to a 
molecule having an amino acid sequence encoded by a polynucleotide of the invention 
as broadly defined (obviously excluding poly-Phenylalanine or poly-Lysine peptide 
sequences which result from translation of a polyA tail of a sequence corresponding to 
acDNA). 

[0331 As used herein, a "nervous system antigen" refers collectively to any 
polynucleotide disclosed herein (e.g., a nucleic acid sequraice contained in SEQ ID 
NO:X or the complement therof, or cDNA sequence contained in Clone ID NO:Z, or a 
nucleotide sequence encoding the polypeptide encoded by a nucleotide sequence in 
SEQ ID NO:B as defined in column 6 of Table IB, or a nucleotide coding sequence in 
SEQ ID NO:B as defined in column 6 of Table IB or the complement thereof and 
fragments or variants thereof as described herein) or any polypeptide disclosed herein 
(e.g., an amino acid sequence contained in SEQ ID NO:Y, an amino acid sequence 
encoded by SEQ ID NO:X, or the complement thereof, an antuno acid sequence 
encoded by the cDNA sequence contained in Clone ID NO:Z, an amino acid sequence 
encoded by SEQ ID NO:B, or the complement thereof, and fi:agments or variants 
thereof as described herein). These nervous system antigens have been determined to 
be predominantly expressed in nervous system tissues, including normal or diseased 
tissues (as shown in Table 1 A column 7 and Table 4). 

[034] In the present invention, "SEQ ID NO:X" was often generated by overlapping 
sequences contained in multiple clones (contig analysis). A representative clone 
containing all or most, of the sequence for SEQ ID NO:X is deposited at Human 
Genome Sciences, Inc. (HGS) in a catalogued and archived library. As shown, for 
example, in colunm 1 of Table lA, each clone is identified by a cDNA Clone ID 
(identifier generally referred to herein as Clone ID NO:Z). Each Clone ID is unique to 
an individual clone and the Clone ID is all the information needed to retrieve a given 
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clone from the HGS library. Furthermore, certain clones disclosed in this application 
have been deposited with the ATCC on October 5, 2000, having the ATCG 
designation numbers PTA 2574 and PTA 2575; and on January 5, 2001, having the 
depositor reference numbers TS-1, TS-2, AC-1, and AC-2. In addition to the 
individual cDNA clone deposits, most of the cDNA hbraries from which the clones 
were derived were deposited at the American Type Culture Collection (hereinafter 
"ATCC"). Table 7 provides a list of the deposited cDNA libraries. One can use the 
Clone ID NO:Z to determine the library source by reference to Tables 6 and 7. Table 
7 lists the deposited cDNA libraries by name and links each library to an ATCC 
Deposit. Library names contain four characters, for example, "HTWE." The name of 
a cDNA clone (Clone ID NO:Z) isolated from that library begins with the same four 
characters, for example "HTWEP07". As mentioned below. Table lA correlates the 
Clone ID NO:Z names with SEQ ID NO:X. Thus, starting with an SEQ ID NO:X, one 
can use Tables lA, 6 and 7 to determine the corresponding Clone ID NO:Z, which 
library it came from and which ATCC deposit the library is contained in. Furthermore, 
it is possible to retrieve a given cDNA clone from the source library by techniques 
known in the art and described elsewhere herein. The ATCC is located at 10801 
University Boulevard, Manassas, Virginia 20110-2209, USA. The ATCC deposits 
were made pursuant to the terms of the Budapest Treaty on the international 
recognition of the deposit of microorganisms for the purposes of patent procedure. 
[035] In specific embodiments, the polynucleotides of the invention are at least 15, at 
least 30, at least 50, at least 100, at least 125, at least 500, or at least 1000 continuous 
nucleotides but are less than or equal to 300 kb, 200 kb, 100 kb, 50 kb, 15 kb, 10 kb, 
7.5 kb, 5 kb, 2.5 kb, 2.0 kb, or 1 kb, in length. In a fiirther embodiment, 
polynucleotides of the invention comprise a portion of the coding sequences, as 
disclosed herein, but do not comprise all or a portion of any intron. In another 
embodiment, the polynucleotides comprising coding sequences do not contain coding 
sequences of a genomic flanking gene (i.e., 5' or 3* to the gene of interest in the 
genome). Li other embodiments, the polynucleotides of the invention do not contain 
the coding sequence of more than 1000, 500, 250, 100, 50, 25, 20, 15, 10, 5, 4, 3, 2, or 
1 genomic flanking gene(s). 
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[036] A "polynucleotide" of the present invention also includes those polynucleotides 
capable of hybridizing, under stringent hybridization conditions, to sequences 
contained in SEQ ID NO:X, or the complement thereof (e.g., the complement of any 
one, two, three, four, or more of the polynucleotide fragments described herein), the 
polynucleotide sequence delineated in columns 8 and 9 of Table 2 or the complement 
thereof, and/or cDNA sequences contained in Clone ID NO:Z (e.g., the complement of 
any one, two, three, four, or more of the polynucleotide fragments, or the cDNA clone 
within the pool of cDNA clones deposited with the ATCC, described herein) and/or 
the polynucleotide sequence delineated in colunm 6 of Table IB or the complement 
thereof "Stringent hybridization conditions" refers to an overnight incubation at 42 
degree C in a solution comprijsing 50% fonnamide, 5x SSC (750 mM NaCl, 75 mM 
trisodium citrate), 50 mM sodium phosphate (pH 7.6), 5x Denhardt's solution, 10% 
dextran sulfate, and 20 fig/ml denatured, sheared salmon sperm DNA, followed by 
washing the filters in O.lx SSC at about 65 degree C. 

[037] Also contemplated are nucleic acid molecules that hybridize to the polynucleotides 
of the present invention at lower stringency hybridization conditions. Changes in the 
stringency of hybridization and signal detection are primarily accomplished through 
the manipulation of formamide concentration (lower percentages of formamide result 
in lowered stringency), salt conditions, or temperature. For example, lower stringency 
conditions include an overnight incubation at 37 degree C in a solution comprising 6X 
SSPE (20X SSPE = 3M NaCl; 0.2M NaH2P04; 0.02M EDTA, pH 7.4), 0.5% SDS, 
30% formamide, 100 ug/ml sahnon sperm blocking DNA; followed by washes at 50 
degree C with IXSSPE, 0.1% SDS. In addition, to achieve even lower stringency, 
washes performed following stringent hybridization can be done at higher salt 
concentrations (e.g. 5X SSC). 

[038] Note that variations in the above conditions may be accomplished through the 
inclusion and/or substitution of altemate blocking reagents used to suppress 
background in hybridization experiments. Typical blocking reagents include 
Denhardfs reagent, BLOTTO, heparin, denatured salmon sperm DNA, and 
commercially available proprietary formulations. The inclusion of specific blocking 
reagents may require modification of the hybridization conditions described- above, 
due to problems with compatibility. 
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[039] Of course, a polynucleotide which hybridizes only to polyA+ sequences (such as 
any 3' terminal polyA+ tract of a cDNA shown in the sequence listing), or to a 
complementary stretch of T (or IT) residues, would not be included in the definition of 
"polynucleotide," since such a polynucleotide would hybridize to any nucleic acid 
-molecule containing a poly (A) stretch or the complement thereof (e.g., practically any 
double-stranded cDNA clone generated using oligo dT as a primer). 

[040] The polynucleotide of the present invention can be composed of any 
polyribonucleotide or polydeoxribonucleotide, which may be unmodified RNA or 
DNA or modified RNA or DNA. For example, polynucleotides can be composed of 
single- and double-stranded DNA, DNA that is a mixture of single- and double- 
stranded regions, single- and double-stranded RNA, and RNA that is nndxture of 
single- and double-stranded regions, hybrid molecules comprising DNA and RNA that 
may be single-stranded or, more typically, double-stranded or a mixture of single- and 
double-stranded regions. In addition, the polynucleotide can be composed of triple- 
stranded regions comprising RNA or DNA or both RNA and DNA. A polynucleotide 
may also contain one or more modified bases or DNA or RNA backbones modified for 
stability or for other reasons. "Modified" bases include, for example, tritylated bases 
and unusual bases such as inosine. A variety of modifications can be made to DNA 
and RNA; thus, "polynucleotide" embraces chemically, enzymatically, or 
metabolically modified forais. 

[041] The polypeptide of the present invention can be composed of amino acids joined to 
each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres, and may 
contain amino acids other than the 20 gene-encoded amino acids. The polypeptides 
may be modified by either natural processes, such as posttranslational processing, or 
by chemical modification techniques which are well known in the art. Such 
modifications are well described in basic texts and in more detailed monographs, as 
well as in a voluminous research literature. Modifications can occur anywhere in a 
polypeptide, including the peptide backbone, the amino acid side-chains and the amino 
or carboxyl termini. It will be appreciated that the same type of modification may be 
present in die same or varying degrees at several sites in a given polypeptide. Also, a 
given polypeptide may contain many types of modifications. Polypeptides may be 
branched, for example, as a result of ubiquitination, and they may be cyclic, with or 
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without branching. Cyclic, branched, and branched cyclic polypeptides may result 
fiom posttramlation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, covalent 
attachment of flavin, covalent attachment of a heme moiety, covalent attachment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, 
covalent attachment of phosphotidylinositol, cross-linking, cyclization, disulfide bond 
formation, demethylation, formation of covalent cross-links, formation of cysteine^ 
formation of pyroglutamate, formylation, gamma-carboxylation, gjycosylation, GPI 
anchor formation, hydroxylation, iodination. methylation, myristoylation, oxidation, 
pegylation, proteolytic processing, phosphorylation, prenylation, racemization, 
selenoylation, sulfation, transfer-RNA mediated addition of amino acids to proteins 
such as arginylation, and ubiquitination. (See, for instance, PROTEINS - 
. STRUCTURE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, W. H. 
Freeman and Company, New York (1993); POSTTRANSLATIONAL COVALENT 
MODIFICATION OF PROTEINS. B. C. Johnson, Ed., Academic Press, New York, 
pgs. 1-12 (1983); Seifler et al., Meth. Enzymol. 182:626-646 (1990); Rattan et al., 
Ann. N.Y. Acad. Sci. 663:48-62 (1 992).) 
[042] "SEQ ID NO:X" refers to a polynucleotide sequence described, for example, in 
Tables lA or 2, while "SEQ ID NO:Y" refers to a polypeptide sequence described in 
column 5 of Table lA. SEQ ID NO:X is identified by an integer specified in column 3 
of Table 1 A. The polypeptide sequence SEQ ID NO:Y is a translated open reading 
firame (ORF) encoded by polynucleotide SEQ ID NO:X. "Clone ID NO:Z" refers to a 
cDNA clone described in column 1 of Table 1 A. 
1043] "A polypeptide having biological activity" refers to a polypeptide exhibiting 
activity similar to, but not necessarily identical to, an activity of a polypeptide of flie 
present invention, including mature forms, as measured in a particular biological 
assay, with or without dose dependency. In the case where dose dependency does 
exist, it need not be idaitical to that of the polypeptide, but rather substantially similar 
to die dose-dependence in a given activity as conq)ared to the polypeptide of the 
present invention (i.e., the candidate polypeptide will exhibit greater activity or not 
more tiian about 25-fold less and, preferably, not more than about tenfold less activity. 
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and most preferably, not more than about three-fold less activity relative to the 
polypeptide of the present invention) . 
[044] Table 1 A summarizes some of the polynucleotides encompassed by the invention 
(including contig sequences (SEQ ID NO:X) and clones (Clone ID NO:Z) and further 
summarizes certain characteristics of these polynucleotides and the polypeptides 
encoded thereby. 

Polvnucleotides and Polypeptides 
TABLE lA 
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L0439: 4, SOOlO: 2, 
H0051: 1 and HOI 00: 1. 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 and L0748: 1. 

SOOlO: 2 

SOOlO: 2 

SOOlO: 3, L0803: 2 and 
L0809: 1. 

SOOlO: 1 and S0260:l. 

SOOlO: 2 

S6026: 1, SOOlO: 1 and 
S0346: 1. 

SOOlO: 2 and L0766: 2. 

H0052: 3, S6024: 1, 
H0261: 1 and SOOlO: 1. 

H0051:3, S0036: 2, 
S0035: 1, SOOlO: 1 and 
S0260: 1. 

SOOlO: 2 

SOOlO: 1,S0346: 1, 
L0770: 1 andL0790: 1. 

L0742: 3, L0769: 2, 
S6024: 1, S0222: 1, 









Pro-24 to Val-29, 
Asp-37 to His«43, 
Lys-58 to Asn-66. 

Ser-1 to Gly-7. 

Glu-1 to Lys-8, 
Asp-19toPro-30, 
Ile-33 to Cys-40. 

Pro-38 to Arg-50. 

Trp-1 to Trp-1 1, 
Asp-17toCys-24. 


Gln-6toTrp-13. 
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HAGCB09 

HAGCB32 

HAGCC42 

HAGCE06 

HAGCE07 

HAGCF46 

HAGCM32 

HAGCM64 

HAGCS70 

HAGCV72 

HAGCX13 
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600281, 
600281 















20ql2-ql3 
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SOOlO: 2 

SOOlO: land TOOlO: 1. 

S0222: 1,T0082: 1, 
SOOlO: 1 andS6028:.l. 

SOOlO: 1 andS0346: 1. 

SOOlO: 2, L0769: 2, 
L0803:2,L0521: 1 and 
L0792: 1. 

L0747: 2, S6024: 1, 
SOOlO: 1,L0646: 1, 
L0766: 1 andL0665: 1. 

S0007: 1 and SOOlO: 1. 

SOOlO: 2 

SOOlO: 1 andS0346: 1. 

SOOlO: 2,L0753: land 
L0592:L 

SOOlO: 2 

L0756: 2, SOOlO: 1, 
S0388: 1, S0036: 1, 
L0638: 1,L0438: 1, 
L0355: 1,L0439: land 
L0740: 1. 

SOOOl: land SOOlO: 1. 

SOOlO: 2 


Thr-12toArg-21. 

Asn-14to Asn-21, 
Phe-26toSer-40. 

Pro-17toHis-22, 
Pro-27 to Gly-32, 
Glu-62 to Arg-72. 


Thr-22toTyr-31. 









His-1 to Ser-6, 
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HAGDV46 

HAGDX45 
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HAGEA58 
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HAGEL88 
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SOOlO: 2 

80010:2 

SOOlO: 2 

SOOlO: 1 and S0050: 1. 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 

SOOlO: 2 

S6024: 1, SOOlO: 1, 
L0744: 1 andL0439: 1. 

SOOlO: 2 

S0222: 2, SOOlO: 1, 
S0346: 1,L0438: 1 and 
L0439: 1. 

SOOlO: 2. L0438: 2, 
L0439: 2 and H0566: 1. 

SOOlO: 2 

Asn-19to Ser-25, 
Gly-50 to Cys-57, 
Tyr-64 to Ser-70. 

Ser-2 to GIy-8. 


Ser-10 to Ser-28. 

Val-25toTrp-32. 


Ile-7toAsn-19. 

Trp-19 to Lys-33. 

Leu-12 to Arg-17, 
Gly-24toArg-41. 

Glu-7 to Gln-16, 
Arg-24toArg-31, 
Gly-52 to Gly-58. 

Val-17 to Lys-22, 
Glu-40 to Pro-52. 

Glu-6 to Val-12, 
Thr-14 to Val-22, 
Ala-31 to Pro-:49. 


Leu-1 to His-8. 

Arg-13 to Phe-22, 
Asn-24to Ala-31, 
Glu-48 to Tyr-60. 
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H0442: 1 and SOOlO: 1. 

1 SOOlO: 2 

1 S0010:2andH0051: 1. 

SOOlO: 3, L0770: 3. 
L0439: 3. L0438: 2, 
L0777: 2,L0630: 1, 
L0764: 1,L0794: 1, 
L0352: 1 andL0743: 1. 

S0346: 2 

L0439: 3, L0438: 2, 
H0229: 1, S0665: 1, 
S0346: 1, S0049: 1 and 
L0366: 1. 

S0346: 2 

S0346: 2 

SOOlO: 1 andS0346: 1. 

S0346: 1 andH0009: 1. 

S0346: 2 

L0005: 1, SOOlO: 1 and 
S0346: 1. 

L0439: 4, S0400: 1, , 
S0346:l,H0374:land 
S0260: 1. 

S0346: 2 

S0346: 2 

L0439: 2, S0665: 1, 

Pro-20 to Asn-26. 

Pro-lOto Ser-15. 


His-38 to Ser-52. 


Phe-15toSer-24. 


Ala-5to Leu-ll. 

Gln-25 to Phe-32. 


Pro-22 to Gly-28, 
Gly-37 to Gln-42, 
Asn-71 to Tyr-82. 


Asn-38 to Asp-44. 


Pro-4 to Lys-29. 

Ser-12toGly-18, | 
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S0346: 1, L0352: 1 and 
L0366: 1. 

T0082: 1, H0009: 1, 
L0521:landL0361:l. 

S6026: 1 andT0082: 1. 

S0222: 1, T0082: 1 and 
H0009: 1. 

T0082: 2 

T0082: 2 

T0082: 1 andSOOlO: 1. 

AR061: 5,AR089: 3 
H0438: 1 and T0082: 

Thr-31toGly-39, 
His-46 to Arg-51, 
Ser-58 to Asp-85. 


Lys-52 to Ser-57. 


Leu-19toGly-29, 
Val-32 to Ser-39. 

Ala-1 to Ala-12, 
Arg-16toThr-23, 
Arg-36to Ser-44. 

Lys-Uto MeM6, 
His-28 to Arg-33, 
Ala-65 to Asn-72. 
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601916 



























S0412: 2 

L0740: 3, L0439: 2, 
S0412:2, L0600: 2, 
L0109: 1, L0065: 1, 
L0774: 1, L0776: 1, 
L0659: 1, L0664: 1, 
L0438: 1, L0779: 1 and 
L0777: L 

S0412: 2 

S0412: 4 

S0412; 5 

S0412:2andH0052: 1. 

S0412: 2 

S0412: 43,L0439: 4, 
S0414: 2, L0438: 2, 
S0378: 2, H0406: 1, 
S0222: 1. H0575: 1, 
S0051: 1,L0744: 1, 
L0747: 1, L0777: 1 and 
S0456: 1. 

S0412: 8, L0438: 1 and 
L0439: 1. 

S0412; 5 

S0412: 4 

S0412: 6, S0414: 4, 
L0439:2, S0300: 1, 



Asp-44 to Arg-50. 



Glu-41 to Thr-46. 



Leu-25 to Asn-33, 
Glu-56 to Glu-63, 
Leu-72 to Glu-78. 

Phe-13 to Asp-20, 
Arg-25 to Met-32. 

Arg-1 to Gly-6. 
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S0414: 5 

S0414: 2 

S0414: 4 

S0414: 56 

S0414: 4 

S0414: 4 

S0414: 11 and SOI 14: 
I. 

S0414: 4 and L0592: 1. 

S0414: 3 

S0414: 9 


Pro-20 to Glu-27. 

GIy-12 to Lys-23, 
Glu-34 to Leu-43. 

Ala-3 to Gly-9, 
Glu-32 to Ile-40, 
Thr-77 to Gln-82. 

Ser-28 to His-35, 
Arg-73 to Trp-79. 




Ser-43 to Tyr-48, 
Thr-73 to Asp-79. 

Ghi-3 to Thr-9, 
Ser-22 to Thr-32, 
Thr-39 to Met-47, 
Pro-49 to Trp-54, 
Arg-56 to Lys-69. 
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H0052:landL0745: 1. 

S0414: 3 
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S0414: 25, L0756: 2, 
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Phe-44 to Leu-49. 
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S0414: 3 

T0126: 2 

H0374: 1 and H0052: 
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S0260: I. 
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Pro-19 to Gly-24. 
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Met-5 to Gin- 10, 
Thr-45 to Leu-62, 
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Asn-27 to Phe-39. 
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Val-31 to Glu-40. 

GIn-1 to Tyr-13. 
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L0805: 3, S0029: 2, 
L0764: 1 andL0766: 1. 

S0029: 2 

S0049: 1,H0052: 1, 
L0742: 1 andL0731: 1. 

S0049: 2 

S0049: 2, L0747: 1 and 
L0599: L i 

S0049: 2 

S6026: 1,S0049: 1 and 
L0756: 1. 

S0049: 2 

1 SOOlO: 1 andS0049: 1. 

TOOlO: 4, L0415:2, 
S0049:2, L0351:2, 
L0805:2, S6026; 1, 
H0618: 1, SOOlO: 1 and 
H0052: 1. 
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S0049: 2 and L0803: 1. 
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S0049: 2 

S0049: 2 

SOOOl: 1.S0049: 1 and 
L0439: 1. 
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S0049: 1, H0052: 1, 
L0I46: 1 andTOOlO: 1. 

S0007: 4, S0049: 1 and 
H0052: 1. 

S0049: 2 

H0434: 2 

H0434: 2 

H0006: 2 

H0310: 1 andS0388: 1. 

H0310: 1,S0051: 1, 
L0657: 1 andL0587: 1. 

H0310: 1, H0052: 1, 
L0438: 1 andL0439: 1. 

L0157: 5,L0805:2, 
L0756: 2, L0753: 2, 
S0412: 2, SOOlO: 1, 
H0310: 1,L0779: 1 and 
L0759: 1. 

H0229: 2 

H0229: 2 

L0439: 5, H0229: 1 
andTOOlO: 1. 

H0229: 2 

L0439:3andH0229: 
2. 



Glu-12toLys-17. 
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Asn-22 to Lys-27, 
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H0229: 2 
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AR089: 5,AR061: 2 
S0001:landS0021:l. 
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S0386: 1 andS0021: 1. 
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S0386: 2 

S0386: 2 
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S0386: 2 
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S0386: 

Pro-28 to Pro-43, 1 
Thr-48toArg-53, i 

Gln-14 to Asp-19. 
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His«22 to Arg-28. 


Lys-6 to Arg-16, 
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H0052: 1 andS0386: 1. 
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S0038: 2 
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S0038: 2 

S0036: 1 and S0038: 1. 
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L0742: 3, L0769: 2, 
S6024: 1, S0222: 1, 
H0438: 1,S0010: 1, 
H0052: 1,T0010:1, 


Lys-38 to Leu-47. 

Ser-21 to Asn-28. 
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Ser-14 to Asn-24, 
Gly-50 to Gly-62, 
Met-67 to His-76. 

Glu-1 to Met-10. 
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Pro-35 to Gly-43, 
Arg-71 to Lys-77. 


Tyr-1 to Thr-13, 
Ile-20 to Gln-29. 
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Ser-36 to Thr-41. 




3679 

3680 

00 

3682 

3683 

3684 

3685 

3686 

3687 

3688 

3689 

3690 

3691 

3692 

3693 

3694 

66-206 

VO 
OS 
CN 
1 

00 
CN 
*— 1 

284 - 484 

3-260 

VO 
OS 

1 

5^ 

222 - 308 

101-226 

o\ 

OI 
CO 
1 

00 

287 - 478 

ro 
m 
1 

VO 
f— 1 

2-205 

CM 

I 

VO 
00 

35 « 226 

in 
as 
cn 
1 

<N 

1-H 

CM 
1 

CM 

101 - 1321 

>n 
m 
m 

VO 

in 
cn 

<o 

OO 

in 

OS 
in 

o 

VO 

m 


cs 

VO 

m 

cn 
VO 
cn 

VO 

CO 

»n 

VO 

m 

VO 
VO 
CO 

VO 

m 

OO 
VO 

ro 

On 
VO 
m 

o 
m 

706093 

782799 

813284 

682566 

706094 

701936 

744645 

925673 

919067 

952501 

963815 

953621 

953726 

823467 

766477 

872987 

HBWCK36 

HBWCK84 

1 HBWCL70 1 

HBWCM27 

HBWCN37 

HBWC044 

HBWC063 

HBWC083 

HBWCP02 

HBWCR07 

1 HBWCRIO 

HBXAC85 

HBXAG07 

HBXAN27 

1 HBXAR75 

HBXAS32 


wo 01/59063 


PCTAJSOl/01334 




















S0038: 1, L0768: 1, 
L0794: 1 and L0790: I. 

S0038: 2 
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Trp-1 to Arg-6, 
Ser-17toSer-24. 
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Pro-37 to Asn-43, 
Asn-53 to Leu-58. 
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S0049: 1. 

H0052: landS0038: 1. 

SOOlO: 1 andS0O38: 1. 

L0439: 3. H0052: 1, 
H0194: 1 andS0038: 1. 
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S0038: 2 

S0038: 2 

H0052: 1, S0038: 1 and 
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H0052: 1 and S0038: 1. 
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Pro-30 to Lys-35. 
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AR089: 4,AR061: 2 
H0438: 3 and S0260: 1, 

H0391: 1 andH0438: 
1. 

H0052: 2 


H0052: 2 

H0052: 2, S0346: 1 and 
L0366: 1. 

H0052: 2 and L0385: 
1. 

H0052: 2 

H0052: 2 

L0750: 2, H0052: 1 
and HOI 72: 1. 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 3 

H0052: 3 

H0052: 1 and HOI 94: 
1. 

H0052: 2 

S0049: 1 andH0052: 1. 


Arg-ll toSer-20, 
Gly-24 to Ala-38, 
Ser-51 toArgT57. 

Pro-32 to Pro-38, 
Leu-43 to Phe-54. 

Phe-19 to Ser-27. 


Arg-12 to Lys-25. 

GIy-1 to Arg-12, 
Gly-31 to Gln-39. 


Glu-18toArg-30. 

Pro-34 to Thr-48. 

Pro-1 to Asp-18. 

Leu-11 to Ser-16. 

Arg-27 to Gly-36. 


Gly-1 to Thr-9. 


Val-1 to Cys-6. 
Leu-23 to Phe-30. 
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S0282: 1 andH0052: 1. 

H0052:2 

H0052: 2 

H0052: 2 

H0261:l andH0052: 
1. . 
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H0052: 2 

SOOlO: 2,H0052: 1, 
TOOlO: 1 andL0594: 1. 

H0052: 2 

H0052: 6 

H0052: 2 

AR089: 5,AR061: 3 
SOOlO: 1 andH0052: 1. 


Pro-lOto Leu-17, 
Trp-43 to Glu-50. . 

Asn-32 to Cys-44, 
Asn-51 to Asn-58. 



His-8 to Gly-21. 

Gly-lltoPro-18, 
Gly-52toGly-57, 
Val-66 to Pro-73. 

Ser-7toGly-15, 
Ser-25 to Pro-32, 
Leu-48 to GIy-61, 
Thr-82 to Gln-88, 
Ser-97toAsp-103. 

Pro- 19 to Cys-27. 

His-57 to Leu-63. 

Ser-12toGly-17. 

Arg-lO to Asn-19, 
Gly-72 to Thr-77. 
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H0052: 2 

H0052: 1 andH0194: 
1. 

H0052:2 

AR061: 6,AR089: 3 
H0052: 2 

AR050: 82,AR054: 
52,AR051: 50 
H0052: 2 and S6014: 1. 




1 His-27 to Gly-35. 


Trp-1 toAla-11. 

Glu-31 to Gly-39, 
Ser-69 to Ser-74. 

Glu-36 to Asp-46. 
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AR061: 3,AR089: 1 
H0052:6,L0741:5, 
S0222: 3, S0031:3, 
S0036: 2,H0261: 1, 
H0441: 1, S0346: 1, 
S0049: 1, S0038: 1. 
H0522: 1, L0742: 1, 
L0752: 1, S0260: 1 and 
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Gly-1 to Arg-13. 
Thr-27 to Thr-34, 
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|L0796: 1,L0789: 1 and 
L0786: 1. 

H0052:2andH0201: 
2. 

H0052: 2 

H0052:2andH0194: 

H0052: 2, H0023:2, 
L0438: 1 andL0439: 1. 

H0052: 2 

H0052: 2 1 

H0052: 2 

H0052:2 

H0052: 3 

AR061: 6,AR089: 3 
L0439: 3, H0052: 2, 
L0758:2, S0222: 1, 
S0346: 1, S0388: 1 and 
L0792: 1. 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 8. H0341:l, 
30194: 1 andL0741: 1. 


Ser-3 to Phe-14. 

Arg-15 to GIy-22, 
Glu-25 to Arg-30. 

Val-25 to Phe-34. 

Pro-40 to Gly-48. 

Asn-23 to Phe-37. 

Gln-1 to Tyr-16. 

Gly-15toGlu-21. 



Pro-20 to Phe-38, 
Gly-74 to Gly-79. 

Gln-1 to Lys-9. 


r 

t 

1 

V 

t 

t 
1 

3 

5 

Val-23 to Pro-43, 
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H0261: 1 ahdH0052: 
1. 

H0052: 1,H0051: I 
andLOSOO: 1. 

H0052: 2 

H0052: 3 and L0439; 
2. 
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Pro-41 to Asp-46, 
Ala-55 to Ser-64, 
Ser-69 to Gln-80. 
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H0052: 2 

H0052: 2 

H0261:2andH0052: 
1. 

H0052: 1 andH0201: 
1. 

H0052: 2 

H0052: 2 

H0052: 1 andS0036: 1. 

H0052: 3 

H0052: 2 

H0052: 2 

H0052:2andH0261: 
1. 

H0052: 1 and H0009: 
1. 

H0052: 2 

H0052: 2 

H0052: 1 and S0050: 1. 


Phe-1 to Ser-8, 
His-16toLeu-34. 




Ala-22 to Pro-27. 

Pro-12 to Ser-17. 


Arg-4 to Met-9. 


Cys-1 to Asn-8, 
His-31 toHis-36. 

Leu-1 to Thr-6, 
Tyr-30 to Ser-49, 
Ser-64 to Ala-70. 

Pro-6toThr-13, 
Pro-24 to Asp-35, 
Leu-68 to Tyr-75. 
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Thr-64 to VaI-77. 
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Iq21.2-q22 

H0052: 2 

|H0052:2 

H0052: 1 and S0386: 1. 

H0052: landH0194: 
1. 

H0052: 2 and L0748: 
1. 

H0052: 2 

H0052: 2 

H0052: 1,H0009: 1 
andL0769:l. 

H0052: 2 


Cys-15 to Ser-20. 

Gln-1 to Tyr-8, 
His-14toTrp-21, 
GIu-38 to Pro-47. 


Leu-20 to Arg-29. 

Pro-24 to GIu-34. 

Asn-1 to Ser-13, 
Gly-28 to GIy-36, 
Gln-38 to Asp-48, 
Thr-64 to Ser-69. 

Arg-27 to Trp-32, 
Arg-34 to Cys-39. 

Glu-5 to Ser-15. 
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HCECE07 

HCECE80 

HCECG77 
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HCECK37 

HCECM21 

HCECM96 
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H0052: 4 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 2 

H0052:2 

H0261: 1,H0052: 1 
andH0201:l. 

H0052: 3 

H0052: 2 

H0052:2andH0261: 
1. 

L0439: 3, H0052: 2. 
L0758: 2, S0222: 1, 
S0346: 1, S0388: 1 and 


Lys-30 to Met-35, 
Ser-37 to Pro-44, 
Thr-56 to Pro-62, 
Met-94 to Gln-100. 

De-28 to Thr-34. 


Lys-6toTrp-13, 
Pro-15toLeu-25. 

Arg-1 to Lys-7. 


Pro-7 to Cys-23, 
Trp-92 to Ser-98. 



Arg-13to Gly-19, 
Thr-24 to Tyr-34. 

Gly-5 to Ser-13, 
Thr-25 to Asp-40. 


Arg-28 to Ser-33, 
Gln-44 to Asp-63. 
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L0792: 1. 

H0052:2 

1 H0052: 2 

S0222: 1 and H0052: 1. 

H0052: 2 

H0052: 1,H0567:1, 
L0754: 1, L0756: 1 and 
L0755: 1. 

H0052: 2 

S0007: 2, L0439: 2, 
L0024: 1,H0052: 1 and 
S0038: 1. 

H0052: 1,H0051: 1 
andL0593: 1. 

SOOlO: 1 andH0052: 1. 

H0052: 2 

H0052:2 

H0052: 2 

H0052: 2 

L0439: 3, H0052: 1, 
H0051: 1 andL0741: 1. 



Thr-40toLys-51. 

Arg-19toLys-36, 
Cys-50 to GIu-56, 
Leu-83 to Trp-90. 

Gln-1 toLeu-7, 
Cys-22 to Leu-27, 
Leu-35 to Asp-42, 
Gln-61 to Arg-66. 


His-7 to Arg-18, 
Gly-31toThr-36. 

AIa-21 toArg-33. 





Arg-2 to Leu-8. 

Ser-14toGly-21, 
GIn-41 to Gln-47. 

Pro-26 to Ser-35, 
Ser-48 to Leu-58, 
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16pl2-pl3.1 


H0052: 2 

H0052:5andH0261: 
1. 

H0052: 2 

H0052: 2 

H0052: 2 

H0052: 2, L0439: 2 
andL0745:2. 

H0052: 2 

L0779:4,L0766:3, 
H0052: 2, L0794: 2, 
L0809: 1, L0752: 1 and 
L0758: 1. 

H0052: 2 

H0052: 3 

SOOlO: 1 andH0052: 1. 

1 Thr-73 to Thr-78. 


Pro-lltoIle-16. 

Thr-34toHis-39. 

Phe-1 to Thr-9. 


His-26 to Gly-32, 

Lys-6 to Ala-19. 


Pro-1 toLys^7. 
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H0052: 2 

H0052: 2 

H0052: 2 and L0777: 
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o 

AR061: 9,AR089: 6 
H0052: 2 and L0439: 
1. 

H0052; 2 and L0740: 


Thr-15toAsp-21, 
Asn-24 to Gln--29. 

Leu-6toTrp-18. 
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Arg-J2 to Lys-19. 
Gly-9 to Ala-16. 
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Ser-86 to Gly-94. 

Thr-77 to Asp-82, 
Ala-87 to Gly-94. 

Pro-3 to Arg-48, 
Arg-57 to Leu-63, 
Glu-82 to Gly-87, 
Thr-95 toAsn-107. 


Pro-15 to Gly-22. 
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H0052:3 and L0752: 
1. 

H0052: 2 

H0052: 2 and L0366: 
1- 

H0052:land H0123: 
1. 

H0052: 3,L0747: 2, 
S0222: 1 andL0763: 1. 

H0052:2and H0261: 
1. 

L0754: 3 and H0052: 
2. 

H0052:2andH0201: 
1. 

H0261: 1,S0222: 1, 
SOOlO: 1 andH0052: 1. 

H0052:2 

H0052:2, H0194: 1, 
H0201:l,L0764:land 
L0608: 1. 

H0052: 2 


Ser-12 to Ser-22, 
Cys-29 to Thr-34, 
Ser-64to His-69. 


Gln-1 to Gly-8. 





Pro-2toLys-14, 
Thr-34 to Ser-45, 
Met-77 to GIu-82, 
Val-92 to His-97. 

Gln-1 1 to Asp-16, 
Pro-38 to Phe-44. 

Cys-21 to Ser-26. 
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H0052: 3, L0787: 1 
andL0758: 1. 

AR050: 16,AR089: 2, 
AR061: 1 
H0052: 2, L0787: 1 
andL0758: 1. 

H0052: 2 

H0052: 12,L0741:3. 
H0194: 2, S0222: 1, 
S0038: 1,L0370: 1 and 
L0439: 1. 

L0439: 3, H0052: 2 
and S0007: 1.- 

H0052: 2 

H0052: 2 

SOOlO: 1 andH0052: 1. 

H0052:2andL0791: 
1. 

H0052: 2 

H0261: 1 andH0052: 
1. 

H0052: 2 

Arg-46 to Cys-63. 

Gln-33 to His-42, 
Pro-55 to Asn-66. 

Ala-5toLeu-19, 

Gln-62toHis-71, 

Pro-84toSer-104, 

Ser-115toMet-121, 

Tyr-136toHis-150, 

Pro-162toGly-169. 

Cys-13toLys-22. 

Leu-31toSer-36. 


Thr-1 to Ala-6. 

Arg-43 to Lys-56. 

Gly-13toGly-18, ^ 
Gly-21 to Gln-28, 
Gly-35 to Lys-47, 
Ala-54 to Gln-63. 

Ala-l to Arg-10. 



Pro-19toHis-24. 
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22ql2.3 

AR089: 1,AR061: 0 
H0052: 2 

SOOOl: 1 andH0052: l.j 

H0052: 2 

H0052: 2 

TOOlO: 2, H0052: 1 
andL0351:l. 

H0052: 1, S0038: 1 and 
L0439: 1. 

S6014: 1 andH0052: 1. 

H0052: 2 

H0052: 2 

H0052: 1 andH0194: 
1. 

S0038:2, L0439: 2, 
H0052: 1 andL0438: 1. 

H0052: 2 

H0052: 1,S0051: 1 and 
L0792: 1. 

H0052: 2 

L0617: 2,L0776: 2, 
S0049:l,H0052:land 
L0805: 1. 

Leu-47 to Arg-56, 
Pro-61 to Leu-67, 
Gly-95 to Leu-106. 

Ala-34 to His-39. 


Leu-28 to Thr-37. 

Ala-6toLys-13, 
Trp-40toGlu-50. 

Ser-30 to Gly-37. 

Thr-lOtoLeu-20. 

Asn-1 to Glu-63. 

Pro-1 to Ser-10. 


Pro-1 to Tyr-6. 


Arg-15toTrp-2L 

Arg-22 to Val-29, 
Pro-40 to Thr-56. 


4052 

4053 

4054 

4055 

4056 

4057 

4058 
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4062 
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4064 

4065 

4066 
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960925 

772669 

530889 

530828 

960518 

964940 

572559 

676423 

575404 

526220 

524100 

739598 

667623 

685271 

585360 

HCEM085 

HCEMP77 

HCEMR37 

HCEMV51 

HCEMZ34 

HCEMZ61 

HCENA26 

HCENB23 

HCENC09 

HCENC60 

HCENJ44 

HCENJ59 

HCENL16 

HCENL90 

HCE1SIM36 
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H0052: 3, L0439: 1 
andL0779: 1. 

H0052: 2 

H0442: 1,H0052: 1, 
L0754:landL0756:l. 

H0052: 2, L0742: 1 
andL0439: 1. 

H0052:2,L0756:2 
andL0750: 1. 

H0052: 2,L0363: 1 
andL0748: 1. 

H0052: 2 

SOOlO: 1 andH0052: 1. 

H0052: 2 

L0005: 1,S0007: 1, 
H0052: 1, TOOlO: 1, 
L0770: 1,L0769:1, 
L0794: 1,L0515: 1 and 
L0790: 1. 

H0052: 2 


Lys-44 to Lys-49. 


Asp-1 to Lys-9. 


Arg-l to Lys-6. 



Asn-1 to Arg-l 1. 


Ala-lltoGly-20, 
Glu-27 to Tip-32, 
Ala-45 to Phe-50, 
Thr-60 to Pro-75. 
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775363 
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511300 
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953849 
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530909 


HCENM85 

HCENP05 

HCENP71 

HCENP80 

HCENY05 

HCE0C21 
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HCEOI74 

HCEONIO 

HCE0R31 

1 HCEOU37 1 
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H0052: 2 

AR089: 10,AR061: 9 
L0747: 2, S0222: 1, 
H0052: 1, S6028: 1, 
L0769: 1,L0768:1, 
L0777:landL0366:l. 


H0052: 2 

S0222: 1, H0052: 1 and 
L0594: 1. 

H0052: 2 

H0052: 5 

L0438: 3, H0052: 2, 
L0439: 2, L0749: 2, 
L0415:1,S6026: 1, 
L0782: 1,L0809: 1, 
L0352:landL0777: 1. 

H0052: 2 

H0052: 1 andHOOSl: 
1. 

Pro-39 to Asn-44. 

Ala-1 to Ser-8, 
Ser-41 to Cys-54, 
Asn-96 to Asp-103. 

Ala-1 to Ser-8, 
Ser-41 to Cys-54, 
Asn-96 to Asp-103. 

Ala-1 to Gln-7, 
Lys-24 to Ser-30, 
Pro-44 to Ser-57, 
Ser-90 toCys-103, 
Asn-145 to Asp-152. 


Ile-2 to Leu-17. 


Val-29 to Thr-38, 
Pro-48 to Ser-53. 

Pro-40 to Arg-52. 

Asp- 12 to Ala-23. 

Pro-4 to Trp-14, 
Ser-27 to Thr-37. 
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H0194:2andH0052: 
1. 

. . m 
o 
OS o 
IT* OQ 

HU261:1,H0052:1 
and HOI 94: 1 

H0194: 1 andS0038: 1 

o ^ 
CM ^ ^ 

^ vo 
-» (M OS 

O O 

H0052: 1 and HOI 94: 
1. 

Ser-33toPhe-41, 
Ser-48 to GIu-53. 



lle-1 to Ser-9, 
GIy-15to GIy-22, 
Trp-33 to GIv-39. 

Gln-1 toTrp-.10. 

Ile-1 to Trp-15, 
Pro-49 to Tyr-56, 
Phe-59 to Val-65. 
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6p22.3-p22.1 





L0748: 2, H0052: 1, 
HOI 94: 1,H0051: land 
L0743:l. 

H0194:2and L0748: 
1. 

H0052: 1 and H0194: 
1. 

H0194: 2 

HO 194: 2 

H0052: 1 andH0194: 
1. 

AR061: 8, AR089: 5 
H0052: 3, S0282: 1, 
H0194: 1, H0009: 1, 
L0789: 1,L0602: 1 and 
L0439: 1. 

H0194: 2 

H0194: 2 

S0222: 1,H0438: 1, 
SOCIO: 1,H0194: 1, 


Pro-25 to Ser-32, 
Trp-44 to His-52, 
Arg-63 to Cys-69. 


Thr-23 to Tip-28, 
ne-32 to Gly-40. 




Asp-15toThr-21, 
Gln-83toIle-91. 



Leu-3 to Glu-13, 
Gly-18toThr-25. 
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H0562: l.TOOlO: 1, 
S6028: U L0769: 1 and 
L0439: 1. 

H0052: 1 andH0194: 
1. 

H0194: 2 

H0052:landH0194: 
1. 

H0052: 1,H0194: 1 
andL0456:l. 

H0194:2 

H0194:2 

H0052: 2 and H0194: 
1. 

H0052: 1 andH0194: 
1. 

H0052: 1 andH0194: 

HOI 94: 3 

S0007: 2,H0052: 1 and 
HOI 94: 1. 

H0194: 1 andH0201: 
1. 

H0194:2andH0052: 
1. 

H0194: 2 

HOI 94: 1 andH0201: 



Pro-38 to Cys-45. 

Asn-7 to Glu-20, 
His-77 to Lys-82. 

Phe-5 to His-22. 



Pro-1 to Trp-9. 



Glu-1 to Pro-6, 
VaHOtoArg-15. 




Gln-1 to Gln-8. 

Pro-1 to Ser-6. | 
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16pl3.3 



*— ( 

H0052: 1 and HOI 94: 
1. 

H0194: 2 

L0758:3 andH0194: 
2. 

H0052:6,L0741:4, 
H0261: 1 andL0750: 1. 

H0261:2,H0052: 1 
and L0749: 1. 

H0261: 1 andH0052: 
1. 

H0261:2 

H0261: 1,H0052: 1 


Pro-21toGln-31, 
Pro-39 to Ser-44. 


Leu-4to Asn-10, 

Met-66 to Arg-72. 

Gly-1 to Cys-10. 

Pro-14toGly-19, 
Lys-65 to Lys-70. 

Arg-2 to Php-9, 

Asn-12to Asn-21. 
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land S6028: 1. 

H0261: 1,S0051: land 
L0438: 1. 

H0261:2 

H0261:2 

H0261:2 

H0261: landH0052: 
1. 1 

H0261:2 

H0261: 1 andS0386: L 

H0261:2 

H0261:2 

H0261:2 

H0261:2 

H0261:2 

H0261:2 

L0742: 18,L0776:5, 
L0777: 5, L0806: 4, 
L0743:4,L0731:4, 
L0770: 3, L0803: 2, 
L0439: 2, L0752: 2, 
L0366: 2,H0261: 1, 
SOOlO: 1,L0769: 1, 
L0651: 1,L0805: 1, 


Gly-30 to AIa-49, 
Ser-61 to Pro-70. 

Pro-9 to Lys-14, 
His-19 to Ser-33, 
Ser-35 to Thr-43. 

Leu-40 to Cys-47. 


Gly-29 to Pro-34. 





Ser-4 to Ser-11, 
Glu-20 to Thr-26. 

Tyr-2 to Glu-23, 
Ser-29 to Ile-34. 



Trp-23 to Leu-33, 
Gly-51 toHis-56, 
Pro-58 to Met-75, 
Leu-85toPro-93. 
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721753 


HCEVH53 

HCEVI50 

HCEVI53 
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H0327: 2 

H0327: 2 

H0327: landH0051: 

H0327: 2 

L0756: 3, S0300: 1, 
H0327: 1, S0388: 1, 
L0598: 1 and S0412: 1. 

H0327: 1 andSOOSl: 1. 

H0327: 2 

S0222: 1,S0049: 1, 
H0327: 1,H0051: 1 and 
L0439: 1. 

H0327: 2 

H0052: 1,H0563: 1 
andL0439: 1. 

L0439: 4, L0770: 3, 
L0756: 2, H0052: 1, 
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Ser-6 to Gin- 19. 


Pro-8 to Gln-16, 
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Pro-45 to Gly-53. 
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Cys-36 toGly-41. 
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S0007: 6. L0439: 2, 
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Thr48to Ser-27. 

Arg-1 to Gly-8, 
Arg-17 to Lys-22. 
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S6024: 3 

S6024: 1 andH0229: 1. 

S6024: 1 andS0036: 1. 

S6024: 2 and S0049: 1. 

S6024: 2 

S6024: 2 

S6024: 1 and SOOOl: 1. 

S6024: 2 

S6024: 1 andH0051: 1. 

9: 5, S6024: 2, 
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S6024: 2 

L0756: 5, S6024: 2 and 
L0604; 2. 
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Arg-36 to Lys-41. 
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Ser-60 to Arg-71. 
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S0260: 1. 
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S0412: 1. 

L0805: 3, L0776: 2, 
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S0300: 1, S0049: 1 and 
L0756: 1. 

S6026: 1 andSOSOO: 1. 

S0300: 2 

1 H0009:3aiidS0300: 1. 

S0300: 2 

S0300: 2 

S0300: 2 

S0007: 4,L0754: 2, 
L0747: 2 and S0300: L 

S6026: 1, S0300: 1 and 
L0779: 1. 

S0300: l,H0462i 1 and 
L0756: 1. 

L0439: 6,L0438:3, 
L0759: 2, S0300: 1, 
H0051: 1.L0794: 1, 
L0805:1,L0352: 1, 
L0742: 1, L0780: 1 and 
L0758: 1. 

SOOOl: 1 andN0006: 1. 

H00O9:2andL0758: 
1. 







Ser-8 toGlu-15. 

Lys-1 to Ser-10, 
Asp-28 to Phe-39. j 

GIu-49 to Trp-54. 

Thr-9 to Asn-24, 
Pro-26 to Pro-40. 


Ala-1 to Asp-6, 
Ala-17 to Lys-24, 
Gln-31 toGln-36. 

Lys-46 to Phe-52. 

Pro-33 to Arg-39, 
Ser-49 to Tyr-58, 
Val-61 to Met-70. , 
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H0009:2andL0522: 
1. 

H0009: 2 

H0009:2 

H0009: 2 

H0009: 1,H0564: 1, 
L0766: 1, L0663: 1 and 
L0759: 1. 

H0009: 2 

H0052: 1 and H0009: 
1. 

H0009: 2 

L0805: 4, L0439: 3, 
S6028: 2, L0776: 2, 
L0438:2, H0009: 1, 
H0172: 1,S0036: 1, 
L0741: landL0756: 1. 

L0539: 1, H0009: 1 
andS0036: 1. 

S0007: 1 andH0009: 1. 

H0009: 2 

AR061: 4,AR089: 3 
L0756: 4, L0439: 2, 
S0412: 2, S0222: 1, 
H0327: 1,H0009: 1, 
L0157: 1 andS0031: 1. 


Ser-28 to Asn-36. 


|IIe-28 to Gly-35. 

Arg-1 to Asn-30. 

Ser-44 to Cys-55. 

Gly-1 to Phe-13, 
Ser-33toPro-5I, 
Gly-70 to Thr-75. 


Ser-20 to Pro-25. 

Ser-1 to Leu-13, 
Pro-15 to Glu-26. 


Asn-49 to Ser-57. 
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H0009: 2 

H0009: 2 

H0009: 2 

H0009: landTOOlO: 
1. 

H0009: 6 

H0009: 8 

H0009: 2, L0439: 2, 
SOOOl: 1 andS0222: 1. 

H0009: 2 

H0052: 1 and H0009: 
1. 

H0009: 2 and L0750: 
1. 

H0009: 3 

H0009:5 

H0009: 2 

H0009: 2 

H0009:2and L0605: 
1. 

H0009:2 

Arg-7 to Trp-12. 

1 Phe-1 to Leu-6. 

Asp-1 to Thr-6. 

Ser-18to Ala-33, 
Lys-43 to Arg-48. 

Ala-1 to Cys-13, 
Lys-22 to ffis-29, 
Leu-61 to Ser-66, 
Thr-69 to Phe-74. 

Arg-15toLeu-24, 
Cys-29toCys-37. 
Ala-45 to Arg-50. 


Arg-18toLys-31. 

Pro-14toPro-21, 
Glu-49 to Arg-58. 


Pro-3 to Gly-14. 



Leu-36 to Thr-46. | 


Gly-27 toHis-34. 

Thr-l to Asp-6. 
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180297, 
230450, 
263200, 
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H0009: 6 
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H0009: 6 

H0009: 3 



Arg-l to Gln-7. 

Pro-7 to Ue-13. 

GIn4 to Gln-16, 
GIn-61 to Asp-72. 

Thr-48 to Tyr-53. 

Asn-1 toLys-11. 

Ser-23 to Gly-33, 
Ala-41 to Arg-47, 
Arg-51 to Arg-56. 

Pro-15 to GIy-25, 
Pro-39 to Glu-49. 

Thr-l to Lys-20, , 
Cys-32 to Lys-42. 

Gly-1 to Asp-7, 
Pro-36 to Gly-46. 

Asn-1 to Trp-7. 



Gln-1 toLeu-11, 
Lys-36 to Gly-46. 

Gly-46 to Trp-55, 
Pro-58 to Ile-66. 
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H0009: 2 
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H0009: 1 andTOOlO: 
1. 

1 H0009: 2 

H0009: 3 

H0009: 2 

H0009: 2 

H0009: 4 ■ 

S0222: 1,H0009: 1 and 
L0439: 1. 



Thr-2 to Ala-24. 

His-1 to Arg-14, 
Ser-20 to Arg-27, 
Ala-52 toArg-71, 
Pro-76 to Gly-84. 






Ser-16 to GIy-22. 
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H0009:2,L0748:2 
and L0439: 1. 

H0052: 2, H0009: 2, 
SOOOl: 1,L0438: 1 and 
L0439: 1. 

H0009:2and L0753: 
1. 

H0009: 2 

H0009: 2,H0052: 1, 
TOOlO: 1 andL0774: 1. 

H0009: 3, 80222: 1 and 
TOOlO: 1. 

H0009: 2 

H0009: 3, L0769: 1, 
L0803: 1,L0438: 1, 
L0439: 1 andL0777: 1. 

H0009: 2 

H0009: 4 

L0439: 3, TOOlO: 2, 
S0007: 1,H0374: 1, 
H0009: 1 andS0388: 1, 

H0009: 2 

H0009: 2 



Pro-15 to Gly-26, 
Leu-35 to Cys-42, 
Gly-59 to Gln-67. 


Ser-l to Cys-7, 
Gln-9toGln-19. 

Glu-23 to Ile-32. 

Ser-5 to Ala-12, 
Gln-15 to His-21, 
His-79 to Cys-86. 


Ser-l to Cys-7. 

Ser-41 to Leu-47. I 

Asn-35 to Tlir-40. 

Ile-1 to Asn-8, 
Leu-36 to Asp-64. 

Arg-22 to Pro-28, 
Gly-31 toArR-45. 

Gln-7 to His-18. 
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H0009: 2 

SOOlO: 1,H0009: 1, 
L0803: 1,L0651: 1, 
L0756: 1 andL0777: 1. 

H0009:2andL0700: 
I. 

H0009: 2 

H0009: 3 

H0052: 1 and H0009: 
1. 

L0794: 2, SOOOl: 1, 
H0009: 1,L0744: land 
L0777: 1. 

S0007: 1 and HOI 72: 1. 

HOI 72: 2 

H0172: 2 

H0172: 2 

H0172: 2 

HOI 72: 2 

H0172:2 

H0172: 2 

H0172: 2 

H0172: 2 

H0172: 2 

H0172: 2 

Gly-21 to Gln-28. 

Arg-42 to Arg-48. 

Glu-27 to Asp-32, 
Gln-37 to Lys-44, 
Ser-53toHis-61. 



Ser-12toLys-18, 
Pro-56 to Glu-66. 

Glu-12toVal-19, 
Ser-50 to Cys-55. 

Arg-14toPro-21. 

Lys-1 to Pro- 10, 
Leu-30toLys-38. 


Val-8 to Ser-15. 

Cys-7toArg-13. 

Ser-28 to Thr-36, 
Trp-68 to Arg-73. 





Trp-11 toTrp-19. 
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S0222: 2 

S0222: 2 

L0756: 4, S0412: 2, 
S0222: 1,H0327: 1, 
H0009: 1,L0157: land 
S0031: 1. 

L0803:2,L0588:2, 
S0222: 1,S0051: 1, 
TOOlO: 1.L0769: 1 and 
L0766: 1. 

2: 1,S6028: 1, 
1,L0796: 1, 
1,L0741: 1, 
1 andL0747: 1. 

S0222: 2, SOOlO: 1, 
S0346: 1, L0438: 1 and 
L0439: 1. 

S0222: 2 

00 

, . *s r> r\ »\ 

VO <S — 

: 2. S6024: 
: 1,L0764: 
: 1, L0805 
: 1 andLO 

: 28, ARC 
?: 5, S0222 
4, H0052: 

2, H0333: 
1,H0194: 

0\ CS VO 

<r> cs VO r- 
CM 

o o q q 

i-J CO i-l 

n a\ 00 o\ 

fs> VO CO ro 

o '!^ 

^333 

S m o 00 ts 
S ''^ 00 
q o o 

^3 h- 1 O O O O 

< CO J J f-H 

Ser-10 to Cys-16, 
Pro-18 to Ser-28, 
Leu-38 to Asp-44. 


Thr-16to Glu-27, 
Pro-54 to His-64. 

Ser-16toTyr-24. 

Thr-19 to Phe-24, 
Leu-28 to His-35. 

Glu-l to Ser-8. 

Gly-19toTrp-26, 
Leu-42 to Trp-52, 
Pro-61 to Thr-75. 


Ser-58 to Gly-71, 
Pro-76 to His-83, 
Pro-89toTrp-107. 


4465 

4466 

4467 

4468 

4469 

4470 

4471 

4472 


2-127 

12-242 

312-130 

223-462 

159-410 

2-229 

2-385 

279 - 635 
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1142 

1143 

1144 

1145 

1146 
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724330 

959353 

950223 

708484 

666390 

716579 

919998 

925485 


HFPBY50 

HFPCA08 

HFPCB06 

HFPCK53 

HFPCL18 

HFPCL44 

HFPCS02 

HFPCS09 
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L0439: 6, L0438: 2, 
S0222: 1, SOOlO: 1, 
H0194:l,L0455:land 
S0036: 1. 

S0222: 3,L0371: land 
L0749: 1. 

L0756: 5, S0222: 1 and 
S0038: 1. 

S0222: 2 

SOOOl: 1 andS0222: 1. 

S0222: 3, L0751: 3, 
L0157: 1, L0803: 1 and 
L0608: 1. 

S0222: 2 

S0222: 2 

S0222: 2 

S0222: 1 and S6028: 1. 

S0222: 2 

S0222: 2 

S0222: 2 

Lys-1 to Leu-20, 
Thr-38 to Arg-44, . 
Arg-51 to Gln-60, 
Gln-74 to Lys-82. 

Arg-30 to Gly-36, 
Arg-38toSer-51. 

Ile-lOto Gln-18, 
Asn-39 to Glu-50. 

i 

o 

Pro-7 to Gly-15, 
Ser-32 to Leu-38, 
Lys-47 to Pro-52, 
Pro-60 to Gln-66. 

Asn-48 to Arg-53. 



Gln-22toArg-31, 
Ala-61 toPro-85. 

Pro-25 to Cys-51, 
Gly-59 to Ser-69, 
Ala-74toAla-80. 

Thr-12 to Val-17. 

Met-9 to Cys-18, 
Ser-35 to Asp-42. 

Ala-13 to Gly-22, 
Ala-25 to Lys-35, 
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4500 
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1-288 
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1177 
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968941 
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HFPEVOl 
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HFPFD78 

HFPFJ54 

HFPFO70 

HFPFQ63 

HFPFT30 

HFPFZ35 

HFPGD04 

HFPGD43 

HFPGE35 
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L0776: 1. 

S0007:1.S0222: 1. 
H0178: 1, S0388; 1 and 
L0779: 1. 

1 S0222: 2 

S0035: 1 andSOOSO: 1. 

S0050: 2 

S0282: 1, SOOlO: 1, 
S0050: 1, L0794: 1, 
L0741: 1,L0747: land 
S0106: 1. 

S6016: 1 andSOOSO: 1. 

S0050: 2 

S0050: 2 

S0050: 2 

S0050: 2 

S0050: 2 

S0050: 2 

S0050: 2 

S0050: 1 andS0031: 1. 

S0050: 2 

S0050: 2 

S0050: 2, S0300: 1 and 
L0439: 1. 

S0050: 2 

S0050: 2 


Thr-35 to Pro-43, 
Pro-60 to Ser-68, 
Ser-72 to Arg-77. 


Ser-16 to Thr-21. 


Arg-5 to Asp-11, 
Gly-20 to Pro-29, 
Glu-37 to Leu-45, 
Glu-52 to Gly-66, 
Ser-88 to Glu-93, 
Leu-105toAsp-123. 



Ile-7 to Gly-15. 



2 

(D 
00 

Glu-20 to Tyr-25. 

Ghi-36 to Ser-47. 


Leu-1 toLys-21. 



Arg-llto Arg-18. 
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921503 

961915 

728462 

676364 

927216 

974470 

757471 

527844 

954089 

527663 

839092 

527839 

827374 

572406 

921595 

527661 

735061 

968382 

521887 


HFPKE09 

HFPKEIO 

HFRAB74 

HFRAF23 

HFRAF66 

HFRAJ66 

HFRA093 

HFRAP76 

HFRAQ07 

HFRAQ70 

HFRAU42 

HFRAU67 

HFRAU78 

HFRAV44 

HFRBDOl 

HFRBD18 

HFRBD55 

HFRBD73 

1 HFRBE38 | 
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S0050: 2 

S0050: 2 

S0050:2 

L0439:2, S0222: 1, 
S0050: 1 andL0438: 1. 

S0050: 3 

S0050:2andS0051: 1. 

S0050: 2 

S0050: 2 

S0050: 2 

AR089: 2,AR061: 1 
S0050: 1 andS0260: 1. 

S0050: 2 

S0050: 2 

SOOOI: 1 andS0050: 1. 

S0050: 2 

S0050: 2 

S0050: 2 

S0222: 1,S0050: 1 and 
S0051:l, 

H0123: 3 

H0123: 2 and L0759: 
1. 

H0123: 2 


Lys-1 toHis-21. 

Arg-7 to Asn-22, 
Leu-28 to Gly-35, 
GIy-55 to Arg-60. 


Ser-3 to Gly-27. 


Asn-35 to Pro-44, 
Thr-56 to Gly-63. 


Asn-U to Leu- 17, 
Pro-19 to Thr-25. 

Ghi-9toSer-15. 



Leu-21 toIle-26. 


Ala-1 to GIn-8. 


His-8 to Ala-17, 
His-26 to Ala-32. 


Leu-13toGlu-26. 
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H0123: 2, L0783: 1 
and L0665: 1. 

H0123: 2, L0747: 1 
andL0756: 1. 

H0I23: 2 

S0222: 1 and HOI 23: 1. 

H0123: 2 

S0222: 1 and HOI 23: 1. 

SOOlO: 1,H0123: 1 and 
S0051: 1. 

H0123:2andL0779: 
1. 

AR089: 3,AR061: 2 
H0563: 1 andH0123: 
1. 

H0123: 2 

SOOOl: 3 

SOOOl: 2 

SOOOl: 2 

SOOOl: 2 

SOOOl: I andS0282: 1. 

SOOOl: 2 

SOOOl: 2 

SOOOl: 1 and HOI 94: 1. 

Ala-7 to Arg-12. 

Pro-5 to Pro-12. 



Gly-18 to Asn-23. 

Lys-9 to Ile-21. 

Gly-l to Gly-7, 
Pro-9 to Pro-14, 
Ser-16toArg-21. 



Glu-1 to Tyr-8. 



Glu-15 to Ser-21. 

Leu-22 to Gly-27, 
Gln-41 to Trp-46. 

Asp-1 to Asn-6, 
Gln-9 to Tyr-21, 
Trp-30 to Ala-36. 

Lys-20 to Tlir-28. 


Arg-1 to Leu-6, 
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4571 

4572 

4573 
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967712 

525623 

525620 

720612 

524358 

917997 

529952 

578937 

657020 

578933 

709050 

667694 

708440 

530313 

738429 

970695 

678574 

974236 

HFTAB60 

HFTAB62 

HFTAE25 

HFrBI47 

HFTBJ74 

HFTBT57 

HFTCX91 

HFTDC19 

HFTDF15 

HFTOPSl 

HFXAA32 

HFXAF16 

HFXAF37 
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HFXAK58 

HFXAK62 

HFXAM22 

1 HFXAM38 


115 


wo 01/59063 


PCT/USOl/01334 






O O O 0\ 

VO o o 
oo o\ r- o 
VO o\ m o 

<— 1 <N O 
~ »-i fN VO 




















7qll.23 

















SOOOl: 2 

SOOOl: 2 

SOOOl: 2 

SOOOl: I,S0222: 1, 
L0748: 1 andL0749: 1. 

SOOOl: 2 

SOOOl: 2 . 

SOOOl: 1,H0438: land 
H0009: 1. 

SOOOl: 2 

SOOOl: 2 

SOOOl: 2 

SOOOl: 2 

SOOOl: 2 

SOOOl: 1 and S0049: 1. 

SOOOl: 2 

SOOOl: 2 

SOOOl: 1 andH0392: 1. 

SOOOl: 2 

SOOOl: 2 

SOOOl: 1 andS0260: 1. 

|Arg-10toGIy-I9, 

o 
+-» 

VO 

o 
00 



Arg-12toSer-18, 
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SOOOl: 2 

SOOOl: I andS0031: 1. 
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SOOOl: 2 
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H0052: 1 andH0194: 1. 
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Thr-2 to Ser-13, 
Lys-18toTyr-32. 

Arg-1 to Arg-6, 
Pro-9toGly-15. 

Lys-3 to Glu-8, 
Pro-22 to Gln-27. 


Ser-29 to Gln-40, 
Pro-49 to Ser-54. 


Leu-2 to Lys-7, 
Phe-20 to Gly-25, 
Ser-54 to Ile-59. 

Ser-1 to Ala-17, 
Ala-29toAla-35, 
Asp-42 to Pro-47. 

Thr-26 to Gly-34, 
Pro-37 to Gly-50, 
Lys-59 to Ala-67, 
Lys-70 to Gln-77. 

Arg-6 to Lys-11, 


Val-7toHis-13, 
Ser-19 to Gly-24. 


Gly-35 to Gly-41. 
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Ser-51 to Gly-62. 
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Arg-1 to Lys-15, 
Gln-36 to Gly-44, 
Gly-57 to Gly-63. 

Ser-12 to Arg-18, 
Thr-35 to Pro-42, 

Ala-15to Cys-22, 
Gly-26 to Glu-37. i 

Asn-1 to Ajg-16, 
Ser-23 to Tyr.34. 

Lys-38 to Pro-43. 

Ser-49 to Gly-60. 

Arg-46 to Pro-52. 

Lys-22 to Gln-28, 
Trp-35 to Tyr-54. 

Asn-7 to Leu-22. 
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S0282: 1,H0438: 1, 

Pro-74 to Pro-97. 

Pro-42 to Ser-48. 

Pro-36 to Cys-43. 
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Gln-69 to Arg-87, 
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SOOlO: 1, S0048: 1. 
S0388: 1, S0036: 1, 
L0369: 1, L0638: 1, 
L0768: 1,L0775: 1, 
L0740: 1, L0747: 1 and 
S0412: 1. 

S0282: 2 * 

S0282: 1,H0392: 1, 
S0388: 1, L0532: 1 and 
L0748: 1. 

S0282: 2 

S0282: 2 

S0282: 2 

S0282: 1,L0747: 1 and 
S0031: 1. 

S0282: 2 

S0282: 1 andS0260: 1. 

L0756: 2, S0282: 1, 
S0007: 1, S0300: 1, 
H0455: 1, S0036: 1 and 
L0439: 1. 

S0222: 2, S6024: 1, 
S0282: 1,L0769: 1 and 
L0752: 1. 

S0260: 2 and S0282: 1. 


Asn-1 toHis-17. 

Glu-24 to Gln-30. 

Gln-8 to ATg-17, 
Gln-92 to Arg-98. 


Pro-12toSer-29. 



Ala-25 to Cys-30, 
Ser-47 to His-52. 


Glu-41 to Ser-46. 

Ala-5 to Cys-13, 
Arg-40 to Val-46. 
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919713 
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778958 

667604 
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HFXJY18 

HFXKA64 

HFXKB29 
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HFXKK28 

HFXKL93 

HFXKU78 
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S0282: 1,S0050: 1 and 
S0260: 1. 

S0282: 1 andH0340: 1. 

1 S0282: 2 

SOOOl: 1 andS0282: 1. 

S0282: 1 andS0386: 1. 

S0001:3andS0282: 1. 

SOOOl: 1 andS0282: 1. 

S0282: 2 | 

S0282: 2 | 

S0282: 2 

SOOOl: 1 andS0282: 1. 

S0282: 1, S0222: 1, 
L0749: 1, L0779: 1 and 
L0599: 1. 

S0282: 2 

S0282: 2 

S0282: 1, SOOlO: 1, 
L0439: 1 andL0592: 1. 

H0020: 2 and L0400: 
1. 

L0743: 2, L0748: 2, 
H0052: 1,H0194: 1, 
H0051: 1, L0764: 1, 

Arg-2 to Ala-7, 
Arg-22 to Ala-34, 
Pro-37 to Asp-46. 


Arg-1 to Ile-7. 

Cys-1 to Gly-8. 

Gly-14toThr-24. 






Cys-32 to Gly-38. 

Lys-17 to Ala-43, 
Pro-46 to Lys-64, 
Ser-66 to Trp-83. 

Thr-l to Asp-7. 

His-1 to Ala-6, 
Pro-11 toAsn-16, 
Glu-26 to Arg-31, 
Ser-37 to Ala-42. 


Tyr-1 to Ser-7, 
Asn-40 to Leu-46. 


4765 

4766 

4767 

4768 

4769 

4770 

4771 

4772 

4773 

4774 

4775 

4776 

4777 

4778 

4779 

4780 

4781 

367-603 

3-293 

O 

1 

VO 

372 - 530 

249 - 380 

58 - 282 

m 
1 

1-H 
o 

OO 
t 

VO 
1 

o 
<s 

60 - 275 

m 
OO 

1 

154-402 

271 -504 

98 - 466 

94-318 

330- 187 

190 - 68 

1441 

1442 

1443 

1444 

1445 

1446 

1447 

1448 

1449 

1450 

1451 

1452 

1453 

1454 

1455 

1456 

1457 

966844 

854357 

717246 

734913 

792004 

775207 

952388 

871958 

747444 

793661 

793183 

713228 

741199 
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507114 

557805 

HFXKZll 

HFXLD19 

HFXLF45 

HFXLI27 

HFXU16 

HFXLK81 

HFXLL07 

HFXLL44 

HFXLM65 

HFXLM94 

HFXLN18 

HFXLN42 

HFXLP61 

HFXLQ90 

HFXLR93 

HHPAC34 

HHPBA50 
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1 H0051:2 

H0051: 1 andSOOSl: 1. 

H0051: 1 andS0036: 1. 

H0051:2 

H0051:2 

H0052: 1,H0051: 1 
andL0593: 1. 

H0051:2 

H0051:2 

H0051:2 

H0051:2 

H0567: 1 andHOOSl: 
1. 

H0051:2 

L0745: 2, S0222: 1, 
H0051: landL0731: 1. 

S0412: 6, S0414: 4, 
L0439: 2, S0300: 1, 
SOOlO: 1,H0051: 1, 
L0769: 1, L0666: 1 and 
L0756: 1. 

H0051: 1 and SOI 12: 1. 

L0741:3,H0051: 1, 
H0201: 1 andL0756: 1. 


Glu-lOtoGly-18. 


Glu-5 toPro-11. 

Gly-5toCys-12. 

Gln-4toPhe-17. 

Lys-1 to Lys-7. 





Ile-7toVal-21. 


Lys-2 to Asp-9, 
Asp-46 to Phe-52. 


Lys-7 to Arg-12, 
Gly-25 to Trp-31. 

Arg-8 to Arg-14. 
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530802 
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530798 
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530795 

791209 

933967 

530482 

857667 


HHPFB69 

HHPFC37 

HHPFD33 

HHPFD65 

HHPFF68 

HHPFG13 

HHPFN25 

HHPF033 

HHPFQ22 

HHPFV56 

HHPFV84 

HHPFW49 

HHPGB56 

HHPGP67 

HHPGV88 

HHPGZ06 
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H0201:2 

H0052: 1 andH0201: 
1. 

H0052: 1 andH0201: 
1. 

H0052;landH0201: 
1. 

H0194: landH0201: 
1. 

H0194: landH0201: 

H0052: 1 andH0201: 
1. 

H0261: 1 andH0201: 

H020l:2 

H0052: 1 andH0201: 
1. 

H0052: 1 andH0201: 
1. 

H0201:3 

H0201:2 
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-H 

s 

o 
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CN 
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z> o 
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O CO 

AR061: 2,AR089: 1 
S0282: 1 andS0051: 1. 

S0051:2and L0731: 1. 




Asn-54 to Arg-59. 


Asn-12 to Asn-23. 




Glu-1 toGly-10, 
Gln-17toCys-27, 
Ser-45 to Ser-50. 

Asn-34 to Asn-41. 
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S0051:2 

L0794: 12, H0455: 2, 
S0051:2,L0776:2, 
L0789: 2, L0439: 2, 
SOOOl: 1,S0222: 1, 
L0157: 1,L0370: 1, 
L0768: 1,L0805: 1, 
L0755: 1 andL0759: 1. 

S0051:2 

S0051:2,L0776:2and 
H0052: 1. 

L0742: 13,L0439:6, 
L0438: 3, SOOlO: 2, 
S6028: 2, L0756: 2, 
S0346: 1,S0051: 1, 
TOO 10: 1 andL0789: 1. 

SOOlO: l,S005l: 1 and 
S0412: 1. 

S0051: 2 

L0748: 5, L0747: 2, 
L0749: 2, L0600: 2, 
H0392: 1 and 8005 1: 1. 

S0051:2 

S0051:2 

S0051: 2 

S0051:2andL0747: 1. 

Phe-8toHis-13. 



Ile-U to Trp-16, 
Ser-54 to Ser-59. 

Leu-17toGly-34, 
Cys-78 to Cys-84. 



Asp-40 to Arg-46. 


Lys-5toPro-10. 

Thr-11 toGly-17, 
Lys-49 to Leu-54, 
Pro-57 to Gly-62. 
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H0052: 1,80051: 1 and 
L0792: 1. 

80388:2, 80051: 1, 
L0803: 1,L0647: 1 and 
L0756: 1. 

80050: 1 andS0388: 1. 

S0388: 2 

80010: 1, 80388: 1 and 
L0559: 1. 

80388: 2 

S0388: 2 

H0052: 1 andS0388: 1. 

H0052: 1 andS0388: 1. 

S0388: 1 andSOOSl: 1. 

S0388: 2 

S0414: 2,S0222: 1, 
S0388: 1 andS0031: L 

S0388: 1 andS0260: 1. 

S6014: 2 

L0756: 4, L0777: 3, 
L0779: 2, 86014: 1, 
H0052: 1, TOOlO: 1, 

Thr-18 to Glu-27. 


Phe-11 to Lys-16. 


Phe-5toSer-ll, 
Val-i7toLys-24, 
Glu-27 to Asp-34, 
Ser-36 to Cys-42, 
Lys-52 to Lys-60. 

Pro-7toGln-13. 



Thr-1 to Ser-10, 
Ser-12toGly-19. 



Lys43 to Lys-21. 


Thr-1 1 to Glu-16, 
Gly-33 to Trp-40, 
Leu-56 to Trp-66. 

Glu-39 to Trp-47, 
Lys-59 to Asn-65, 
Lys-86to Arg-lOO. 
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12ql3 

L0770: 1, L0794: 1, 
L0809: 1 an(lL0758: 1. 

S6014:2 

S6014: 1 andS0051: 1. 

1 S0220: 2 

AR061: 1,AR089: 1 
S0220: 1 andSOOSO: 1. 

AR089: 1,AR061: 0 
S0220:landS0051:l. 

L0439: 3, S0222: 1 and 
TOOlO: 1. 


Pro-33 to Thr-43. 




Arg-l to Thr-10. 

Lys-lOOtoPhe-110. 
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10q23.3-q24 



H0392: 1 andTOOlO: 
1. 

TOOlO: 2 

TOOlO: 3 

S0222: 1, SOOlO: 1, 
S0346: 1 andTOOlO: 1. 

L0742: 4, TOOlO: 2 and 
L0748: 1. 

Axg-lO to Gly-27, 
Arg-47 to Cys-53, 
His-88toGly-99. 


Se>14toTrp-26. 


Arg-22 to Lys-27, 
Pro-53toArg-61, 
Pro-69 to Met-83. 

Ser-10 to Gln-24. | 
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S6024: 1,80001: 1, 
L0770: 1, L0768: 1, 
L0794: 1. L0776: 1, 
L0352: 1,L0745: 1. 
L0777: 1, L0753: 1 and 
L0758: 1. 

TOOlO: 2 

H0194: land TOOlO: 
1. 

S0030: 1 and TOOlO: 1. 

L0439: 8,L0351:3, 
TOOlO: 2, L0438: 2, 
L0747: 2, H0327: 1, 
L0769: 1,L0800: 1, 
L0352: 1,L0740: 1, 
L0751: 1,L0745: 1, 
L0750: 1,L0756: land 
L0777: 1. 

H0052: 1 and TOOlO: 
1. 

AR051: 10,AR050: 2, 
AR089: 1,AR054: 0, 
AR061: 0 
L0439:2, S0222: 1, 
TOOlO: 1 andL0794: 1. 

L0756: 4, L0777: 3, 
L0779: 2. 86014:1, 


Asn-14 to Lys-20. 

His-1 1 to Gln-22, 
Leu-41 to Glu-4g, 
Pro-51 toGln-60. 




His-1 toGIy-8. 

Pro-lOtoSer-22. 

Thr-2toArg-10. 


4893 

4894 

4895 

4896 

4897 

4898 

6572 1 

4899 


243 - 58 

oo 
1 

272 - 78 

414- 196 

237 - 509 

434 - 297 

193-65 

7-210 


1569 

1570 

1571 

1572 

1573 

1574 

3248 

1575 


753950 

504185 

925514 

504180 

536995 

890949 

890950 

504155 


HIBCW65 

HIBDB88 

HBDE04 

HIBDE84 

HIBEB18 

HIBEF51 

00 

o • 
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H0052: 1,T0010: 1, 
L0770: 1, L0794: 1, 
L0809: I, L0439: 1 and 
L0758: 1. 

L0439:5andT0010: 2. 

AR089: 0,AR061: 0 
TOOlO: 2 

L0439: 3, TOOlO: 1 and 
S6028: 1. 

S0001:2,L0163:2, 
TOOlO: 2,H0455: 1 and 
L0741: 1. 

H0384: 2 

H0384: 2 

S0414: 12,L0439: 2, 
H0441:l,L0157:land 
L0742: 1. 

S0324: 2 

H0346: 2 

H0346: 2 

H0346: 2 

H0346: 2 


Gly-3 to Tyr-10, 
Pro-12 to Thr-19, 
Ser-32 to Ser-37. 

Phe-1 to GIn-6, 
Lys-44 to Asn-51. 


Gln-2 to Gly-1 1, 
Ser-19 to Gly-25. 

1 
B 

Gly-1 to Ser-10. 

Pro-15 to His-21. 

Gln-3 to Phe-8. 

Glu-1 to Lys-6. 

His-20 to Leu-32, 
Ghi-41 to Arg-48. 


V£> 

1 

o 

1 


Lys-29 to Pro-34. | 


4900 

6573 

4901 

. 4902 

6574 

4903 

4904 

4905 

4906 

4907 

4908 

4909 

4910 

4911 


295-468 


3-416 

304 - 543 

563 - 402 

55 - 432 
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1587 1 


537044 

537054 

731480 

503401 

503402 

854491 

840126 

766982 

669899 

681213 

523328 

856210 

522486 

529416 1 


HIBEJ82 


HIBEN23 

HIBE065 

HKB1E62 

HKB1F66 

HiaYC19 


HMDAA14 

HMDAA15 
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S6024: 1 andH0346: 1. 

1 H0346: 1 andS0300: 1. 

1 H0346: 2 

H0346: 2 

S0001:2andH0346: 1. 

H0346: 2 

H0346: 2 

H0346:2 

L0748:2,H0346: 1, 
H0123: landL0639: 1. 

H0346: 2 

L0770: 7, L0439: 4, 
L0805: 3, L0776: 2, 
p438:2, L0756: 2, 
L0415: 1,80010:1, 
H0390: 1, S6028: 1 and 
L0794: 1. 

S6028: 3 

AR051: 10,AR050: 9, 
AR054: 9 
L0438: 2, L0439: 2, 
S0300: 1.S0222: 1, 
S0051: 1,S6028: 1, 
S0036: 1 andL0769: 1. 

S6028: 2 

Gly-l7toGln-39. 



Gly-1 to Gln-10. 

Lys-25 to Phe-30, 
Ala-87 to Asn-92. 


Pro-1 to Glu-13, 
Lys-59 to Gly-66. 

Val-2 to Trp-18. 
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301201, 
301590, 
309550 
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Xq27.3 


L0731: 1. 

H0052: 1, S6028: 1, 
L0748: 1 andL0731: 1. 

L0438: 2, L0439: 2, 
S6028: 1, S0036: 1 and 
L0592: 1. 

S0220: 1 andS6028: 1. 

S6028:2andS0051: 1. 

SOOlO: 1 andS6028: 1. 

S6028: 2 

H0261: 1,S6028: 1 and 
L0731: 1. 

L0439: 3, H0052: 2, 
S6028: 1. L0438: 1 and 
S0260: 1. 

S6028: 2 and L0439: 1. 

S0282: 1 andS6028: 1. 



Thr-14 to His-20. 

Thr-14 to His-20. 



Lys-1 to Asp- 10. 


Pro-1 to Gin- 15, 
Pro-17 to Ile-23. 

Pro-lOto Glu-16, 
Arg-52 to Gly-57. 

Pro-8 to Ser-13, 
Gln-20 to Asn-27, 
His-32 to Gly-37, 
Gly-45 to Gly-52. 

Lys-16 to Glu-23, 
Gln-28 to Phe-33. 

Ala-35 to Gly-40. 


4943 

4944 

6576 

6577 
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4946 
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4950 

4951 

4952 
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795121 

526087 

871032 

871033 

781070 

960080 

578291 

739134 

706228 

507373 

666217 

761384 


HMIAY95 

HMIAZ93 

1 HMIBA83 

HMBD06 

1 HMIBH81 

1 

HMICG44 

HMTCG64 

HMICL18 

L HMICL72 
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S0049: 1 andS6028: 1. 

L0439: 4, S0346: 1 and 
S6028: 1. 

S0300: 1 andS6028: 1. 

S0222: 1.S6014: 1, 
SOOlO: 1, S6028: 1 and 
S0036: 1. 

S0386: 2 and S6028: 1. 

AR089: 5,AR061: 4 
S0388: 1 andS6028: 1. 

S0222: 1 and S6028: 1. 

H0052: 1 and S6028: 1. 

H0391:2 

L0518:2,H0391: 1 
andH0172; 1. 

H0392: 2 

^061: 2,AR089: 1 
S0007: 2, H0392: 2, 
L0745: 1,L0753: 1, 
L0759: 1 andL0589: 1. 

H0392: 1 andS0048: 1. 

H0392: 2 

T-i od 

ON CM ^ ^ 

O tIh' ON ^ 
VN VO GO £2 

o o o § 


Ala-20 to Ser-26. 


Asp-1 toVal-15. 

Ser-5 to Gly-10. | 


Asp-43 to Thr-49, 
Ser-56 to Asn-62. 

GIy-34 to Lys-41. 

Pro-l to Leu-6. 


Trp-lltoAla-18, 
Pro-48 to Asn-53. 




Pro- 16 to Asn-26, 
Asn-35toThr-41, 
Asp-84toGly-91, 
Gln-108toThr-114. 
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662750 

733060 

669540 

911560 

927313 

707941 

625257 

781870 

573634 

959439 

856035 

950537 

702611 

573520 

950532 

1 HMICM17 

HMICM88 


HMICO20 

HMICQ32 

HMICR35 

HMICR53 

HMICS71 

HMICT95 

HMJAC89 

HMJAX08 

HMKAD61 

HMKAD85 

HMKAF33 

HMKAF85 

HMKAJ50 
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L0592: 2, L0594: 2, 
SOOOl: 1,H0392: 1, 
L0157: landL0647: 1. 

H0392: 2 

H0392: 1, H0327: 1 
andL0753: 1. ; 

H0392:2andL0790: 
1. 

H0392: 2 

H0392: 2 

H0392: 2 

H0392: 2 

S0300:l,H0392:land 
L0756: L 

H0392: 2 

H0392: 2 

H0392: 1 andH0052: 
1. 

H0392: 2 

SOOOl: land H0392: 1. 

H0392: 2 
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Pro-l to Lys-6, 
Ser-105toTrp-110. 

Gln-15toGlu-23. 

Gln-38 to Ser-43, 
Arg-67 to Asp-72. 

Lys-1 to Gly-13, 
Val-15to His-25. 

Pro-16to Ala-23, 

Glu-29 to Gln-35. 

Ala-7 to His-14, 
Ser-43 to Tyr-54. 


Glu-1 to Gln-10. 


00 

<; 

Lys-14toLys-21, 
Pro-25 to Lys-33, 
Arg-49 to Val-54, 
Asp-91 to Leu-97. 

Leu-11 toGlu-16. 
Asn-1 to Arg-8, 
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188450, 
188450, 
188450 
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H0392: 2 

H0392: 2 

H0392: 2 

H0392:2 

H0392: 2 

L0743:3,L0439: 3, 
H0392: 1 and.TOOlO: 1. 

H0392: 2 

H0392: 2 

H0392: 2 

S0001:l,H0392:land 
L0366: 1. 

H0392: 2 

H0392: 2 

S0222: 1 andH0392: 1. 

H0392: 2 

H0392:landH0123: 
1. 

H0392: 2 

Glu-23 to Ser-33, 
Arg-41 to Lys-53. 

Arg-1 toAIa-9, 
Ser-48 to Phe-56. 

1 Arg-14 to Pro-21. 

1 Lys-26 to Phe-34. 

Leu-7 to Leu-12, 
Gln-20 to Asn-34. 




Lys-50 to Thr-57. 


Tyr-34 to Ser-39. 

Trp-3 to Tyr-21, 
Arg-23to Ser-41, 
Cys-44toTrp-80, 
Thr-85 to Leu-93. 

Asp-1 to Asn-8, 
Tyr-11 toAsn-19. 

Ser-5toLys-13. 

Arg-18toHis-23. 

Ala-1 to Gln-9, 

Ser-17 to Ser-23, 
Asp-35 to Ser-41. 
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Gly-39 to His-44. 
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SOOOl: 1,S0031: 1 and 
S0260: 1. 

S0106:2 

S0031:2 

H0409: landS0031: 1. 

S0031:2 

S0031:2 

S003I:2 
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SOOlO: 1 andS0031: 1. 
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Thr-l to Gly-7, 
Ser-22 to Met-28, 
Ser-53 to Met-59, 
Asn-77 to Leu-82, . 
Glu-96 to Phe-105, 
Phe-llltoArg-116, 
Arg-121 to Asp-126. 
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80031:2 

S0031:2 

AR061: 107,AR089: 
62 

L0771:2,L0666:2, 
L0755: 2, S6024: 1, 
H0123: 1,L0650: 1, 
L0792: 1,L0750: 1, 
L0779: 1,L0777: land 
S0031: 1. 

S0031:2 

S0031:2 

S0386: 1 andS0031: 1. 

S0031:2 

80031:2 1 

S0031:2 

S0031: I andS0260: 1. 

S0031:2 

AR061: 5,AR089: 2 
80031:2 

S0031:2 

\o 
m 
o 
O 

00 

-o 

§ 

o 

00 

H0052:2andS0031: 1. 

S0282: 1 andS0031: l.| 

1 Ser-30 to Ser-35. 


Gln-6 to Ser-14. 


Lys-12toTrp-18, 
Gly-27toIle-41. 

Ile-8 to Ser-22. 

Phe-3 to Arg-8. 


Phe-20 to Asn-26. 




Gln-57 to Gly-63. 

Tyr-12toAsn-20, 
Thr-58 to Gly-65. 

Pro-15 to Cys-28, 
Pro-50 to Cys-57. 


Phe-22 to Ser-27. 
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S0031:2 

S0031:3 

S0051: 1 andS0031: 1. 

S0031:2 

L0756: 2, S0388: 1, 
L0776: 1 andS0031: 1. 

L0777: 4, S0031: 2, 
H0051: 1,L0667: 1, 
L0805: 1,L0659: 1 and 
L0756: 1. 

S0031:3 

S0031: 2 

Arg-48 to Arg-55, 
Pro-66 to His-84, 
Arg-112toGly-117. 

Lys-24 to Ser-34, 
Lys-58 to Leu-64. 

Ala-1 to Gln-24. 


Leu-6 to Arg-11, 
Leu-54 to Ser-62, 
Ser-68 to Pro-^80. 

Lys-1 to Thr-9, 
Lys-34 to Phe-43, 
Arg-49 to Thr-62, 
Thr-71 to Gly-78. 


Pro-17 to Pro-26, 
Leu-47 toThr-61. 
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80031:2 

80282: 1 and 80031: 1. 

80031:2 

S0386: 1 and 80260: 1. 

80260: 2 

80051: 1 and 80260: 1. 

AR089: 18,AR061: 10 
80031: 1 and 80260: 1. 

80051: 1,L0777: 1 and 
S0260: 1. 

80300: 1, L0754: 1, 
L0750: 1 andS026D: 1. 
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S0260: 2 

S0260: 2 

Arg-l to Asn-14, 
Gln-21 to Glu-26, 
Arg-46to Arg-61. 

Thr-5 to Trp-13, 
Arg-45 to Thr-52. 

His-1 to Asp-1 3. 

Asp-1 to Asp-6, 
Pro-17toSer-35, 
Gln-49 to Gly-57, 
Ala-59 to Arg-74, 


Pro-29 to Phe-34, 
Thr-84 to Asp-105. 

Glu-33 to Ala-47, 
Ala-51 to Gly-60, 
Pro-71 to Trp-77, 
Trp-84 to Glu-90. 


Glu-36 to Lys-50, 
Asn-57 to Pro-62. 



Glu-6 to Ser-19, 
Pro-22 toAsp-29. 

Asn-2 to Cys-10, | 
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AR061: 4,AR089: 2 
S0050: 1 andS0260: 1. 

S0260; 2 

AR089: 0,AR061: 0 
S0260: 2 

S6024: 1 andS0260: 1. 

H0051: 1 andS0260: 1. 

AR061: 1,AR089: 0 
S0260: 2 

S0031: 1 andS0260: 1. 

S0031: 1 andS0260: 1. 

H0052: 2, L0438: 1 
andS0260: 1. 

S0222: 1, S0388: 1 and 
S0260: 1. 

S6024: 1, L0756: 1 and 
S0260: 1. 

SOOlO: 1 and S0260: 1. 

S0346: 1 and S0260: 1. 

L0439: 2,L0424: 1, 
S0222: 1,L0805: 1, 
S0260: 1 and L03 66: 1. 

S0260: 2 

S0036: 1 andS0260: 1. 

GIu-13 to Gly-20, 
Arg-38toMet-51. 



Glu-89 to Lys-94. 

Leu-lltoMet-16. 

Ser-4toTyr-16, 
Leu-20 to Cys-27. 

Lys-1 to Gly-9, 
Gly-41 toGln-49, 
Leu-51 to Gly-56. 

Ala-2 to Asn-32. 

Arg-20 to Asn-31. 
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L0438: 2, L0439: 2, 
H0064: 1 andS0260: 1. 

H0009: 2 and H0400: 
1. 

H0400: 2 

H0409: 2 

H0409: 2 

H0442:2aiidL0471: 
1. 

H0455: 2, L0769: 2, 
L0564: 1 andL0763: 1. 

H0455: l,T0082: 1, 
L0109: 1, L0770: 1, 
L0634: 1, L0748: 1. 
L0747: 1, L0749: 1 and 
L0752: 1. 

H0455: 1 and S0036: 1. 

H0455: 1 andH0052: 
1. 

L0361:2, S0036: 1 and 
S0031: 1. 

S0036: 2 

L0439: 6, S0222: 1, 
S0036: 1 and L0438: I. 


Asn-62 to Asn-67; 

Lys-1 to Tyr-7, 
His-32 to Tyr-38. 


Ser-6 to Gln-1 1. 

Ser-19toGln-24. 

Arg-13 to Gln-19. 

Arg-1 toGlu-13, 
Arg-16 to Asn-25. 

Ser-4toSer-10. 

Pro-20 to Tyr-35. 



Glu-32 to Asn-39, 
Arg-45 to Gln-54. 
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S0036: 3 

S0036: 2 

AR051:736,AR054: 
534, AR050: 481 
S0036: 3 

S0036:2 

L0750: 2, L0759: 2, 
SOOlO: 1, S0036: 1, 
L0803: 1 andL0779: 1. 

S0036: 3 

S0036: 2 

SOOlO: 1 and S0036: 1. 

S0049: 1 and S0036: 1. 

S0036: 2 



Ser-52 toTrp-61, 
Pro-66 to Asn-76. 

Leu-7 to Pro-17, 
Gly-37 to Glu-44, 
His-89 to GJy-95. 

Pro-25 to Tyr-40. 

Val-lOtoVaI-24. 
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S0036:2 

S0282: 1 and S0036: 1. 

AR054: 10,AR061: 4, 
AR050: 2.AR089: 1 
S0038: 2, S0036: 1 and 
L0681: 1. 

AR051: 6,AR061: 1, 
AR089: 0 
S0036:2 

L0157: 2, S0036: 2, 
SOI 10: 1,H0406:1, 
L0770: 1 andL0794: 1. 

S0282: 1 and S0036: 1. 

S0036: 1 and S0386: 1. 

S6024: 1 and S0036: 1. 

H0052: 1 and S0036: 1. 

S0036:2,L0521: 1 and 
L0599: 1. 

S0036: 2 

S0036: 2 

Glu-26 to Trp-34, 
Thr-36 to Leu-42. 


Asp-20 to Leu-26, 
Pro-55 to Gln-63, 
Glu-120toPro-130, 
Tyr-160toArg-181. 
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Leu- 11 toTrp-16, 
Pro-47 to Gly-59. 


His-1 toPro-16, 
Thr-40 to Cys-47. 


Lys-1 to ThT-6, 
Lys-17toLys-23, 
Glu-31toAsn-36, 
Gly-51toLys-56, 
Pro-90 to Lys-96. 


Ghi-l toArg-10. 

Pro-24 to Cys-29, 
Ghi-40 to Glu-49. 
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S0036: 2 

H0009: 1 andS0036: 1. 

SOOlO: 1,S0346: land 
S0036: 1. 

S0036: 2 

L0439: 5, L0438: 4. 
H0123: 1, TOOlO: 1 and 
S0036: 1. , 

S0036: 2 

S0036: 2 

S0036: 2 

S0036: 2 

S0036:2andL0773: 1. 

L0438: 2, L0439: 2, 
S6028: 1,S0036: 1 and 
L0592: 1. 

S0036: 2 

S0036: 1, S0386: 1 and 
L0366: 1. 

H0051: 1,S0036: 1 and 
L0756: 1. 

H0052: 1. S0036: 1, 
L0764: 1, L0776: 1 and 
L0657: 1. 

S0222: 1 andS0036: 1. 



Phe-3 to Glu-12. 




Ser-8 to Asn-22. 

Lys-5 toTrp-lO. 

Tyr-l toSer-11. 
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Lys-30 to Gln-35. 

Pro-9toThr-16, 
His-28 to Arg-50, 
Ile-79 to Trp-89, 
Pro-92 to Gly-104, 
Leu-116stoLys-123. 
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Pro-7 to His-14. 
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S0036: 2 

H0052: 1 andS0036: 1. 

AR089: 4,AR061: 3 
S0222:2,H0572: 1, 
S0036: 1,L0741: 1, 
L0742: 1 andL0780: 1. 

H0009: 1 andS0036: 1. 

S0282: 1 and S0036: 1. 

S0036; 2 

AJ1089: 4,AR061: 3 
T0010:3,S0049: 2, 
H0052:2, L0415: 1, 
H0618:landS0010:l. 

Arg-44toSer-51, 
Glu-53 to Glu-59. 


Phe-14 to Gly-27, 
Thr-56 to Cys-80, 
His-82 to Pro-90, 
Glu-141 to His- 147. 

Ala- 11 toGly-20. 

Thr-26 to Leu-35, 
Pro-39 to Ala-44. 

Pro-25 to Thr-30, 
Thr-41 to Lys-46, 
Gly-55toSer-61. 

Gly-2 to Thi-lO, 
Glu-99toGly-104. 
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1 L0748: 2 and S0036: 1. 

S0036: 1 and L0777: 1. 

S0036:landL0361:l. 

S0036: 1 

S0036: 1 andL0748: 1. 

S0036: 1 

S0036: 1 

S0036: 1 

S0036: 1 

S0036: 1 

S0036: 1 

S0036: 1 

L0769: 2, L0622: 1, 
S0036: 1,L0770: 1 and 
L0803: 1. 

S0036: 1 

S0036: 1 andL0748: 1. 

L0604: 3, L0766: 2, 
S0036: 1,L0772: 1 and 
L0774: 1. 

S0036: 1 

S0036: 1 

S0036: 1 andL0748: 1. 

S0036: 1 

Thr-29 to Trp-45. 


Glu-5 to Pro-13, 
Thr-29 to Met-35. 

Pro-1 to Asp-11. 


Thr-1 to Lys-8. 

Arg-3 to Glu-10, 
Arg-28 to Pro-36. 



Pro-25 to Lys-38, 
His-50 to Gly-55. 

Gln-1 to Asp-14. 


Asn-42 to Gly-47. 

Tip-62 to Lys-68. 




Ser-1 toArg-ll. 
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683594 

924746 

572958 

787016 

724556 

935859 

574531 

792899 

574533 1 


HSXEC47 

HSXDG96 

HSXDA85 

HSXCZ43 

HSXCZ21 

HSXCW61 

HSXCV53 

HSXCV44 

HSXCV41 

HSXCT69 
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109543, 
600631, 
601499 






192340, 
234200 
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20pl3-pl2.1 





S0036: 1 

S0036: 1 

S0036: 1 

S0036: 1 

S0036: 1 

S0036: 1 andL0594: 1. 

L0623: 1 andS0036: 1. 

L0756: 2 and S0036: 1. 

S0036: 1 

S0036: 1 andL0766: 1. 

L0777: 2, S0036: 1 and 
L0756: 1. 

S0036: 1 andL0756: 1. 

L0439: 2 and S0036: 1. 

S0036: 1 

L0754:2andS0036:l. 

L0770: 5, S0036: 1 and 
L0764: 1. 

T-H 

to 

1 Ile-54 to Ala-59. 

Trp-1 to Gly-10, . 
Arg-l 5 to Pro-24, 
Leu-39 to Phe-50. 

Lys-29toPhe-37. 

Lys-1 to Ser-9. 

Arg-l to Asp-11, 
Leu-16 to Pro-24. 





Thr-1 to Asn-9. 


Asn-1 to Leu-8. 

Val-7 to Ser-14. 

Met-8 to Pro-13, 
Pro-38 to Trp-44, 
Leu-85 to Thr-90. 

Ala-36 to Pro-49, 
Ala-65 to Glu-72. 

Pro-26toPro-31. 
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147781, 
172471, 
182381 













16pll.2 





12pll 







S0036: 1 

L0766: 2 and S0036: 1. 

L0754: 4 and S0036: 1. 

S0036: 1, L0809: 1 and 
L0740: 1. 

H0455: 1 

L0439: 2, H0455: 1 
andL0438:l. 

H0455: 1 andL0744: 
1. 

L0754: 4, L0438: 3, 
L0777: 3,H0455: 1, 
L0439: 1 andL0731: 1. 

H0455: 1 andL0439: 
1. 

H0455: 1, L0439: 1 
andL0592: 1. 

H0455: 1 and L0758: 
1. 

L0748: 2, H0409: 1 
andL0749: L 

H0409: 1 


Pro-1 to Arg-6, 
Asn-lltoThr-16, 
Glu-65toPhe-71, 
Tyr-117toGly-127. 

1 

ai 

CO 
< 

2 

vo 

1 

Arg-18toTrp-31. 

Ser-1 to Arg-12, 
Gln-27toGly-37, 
Ala-41 to Ser-47. 

Ser-32 to Gln-38. 




Asn-1 to Asp-6, 
Asn-41 to Tyr-46. 

Ala-17toArg-24, 
Tyr-32 to Asp-38, 
Val-79 to Cys-86, 

Pro-20 to Phe-27. 

Phe-1 to Leu-6, 
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894180 
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HSXAWll 

HSXAJ56 
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HSDZQ65 
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HSDZ024 
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lq23 


H0400: 1, L0438: 1 
andL0439: I. 

L0439: 2 and S0260: 1. 

L0748: 1 andS0260: 1. 

AR061: 2,AR089: 1 
S0260: 1 

AR089: 15,AR061: 6 
S0260: 1 andL0581: 1. 

Cys-87 to Ser-94. 


Trp-2toSer-ll, 
Gly-24 to Ser-37. 

Leu-lOtoHis-22, 
Lys-36toLys-41. | 

Asp-65 to Ala-70. 

Asn-42 to Gly-47, 
Lys-55 to Ala-62. 
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S0260: 1 

L0803: 1,L0438: 1, 
L0439:1,L0750:1, 
L0777: 1 andS0260: 1. 

L0766: 1 andS0260: 1. 

;^ o o 

L0745: 2, L0766: 1, 
L0659: 1,L0666: land 
S0260: 1. 

S0260: 1 

L0748: 1, L0745: 1 and 
S0260: 1. 

80260:1 and L0608:l. 

L0779: 1 andS0260: 1. 

L0748: 1 andS0260: 1. 

L0776: 3, L0005: 2, 
L0809: 2, L0626: 1, 
L0777: 1 and S0260: 1. 

S0260: 1 1 



Pro-4toGly-ll, 
Ala-19to Leu-25, 
Gln-93 to Gly-103. 

Lys-8 to Ser-14, 
Pro-28 to Ser-33, 
S6r-51 to Ala-57, 
Thr-84 to Pro-97, 
Lys-119toPro-130, 
Asn-134to Arg-147. 

Val-64 to Ala-71. 

Gln-8 to Val-20. 




Lys-14 to Ala-23, 
Pro-25 to Arg-33. 


Thr-31 toPro-37. 

1 

Asp-19 to Thr-26. 
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AR051: 2,AR050: 2, 
AR054: 1 
S0260: 1 

L0769: 1 and S0260: 1. 

L0438: 4, L0439: 3 and 
S0260: 1. 

S0031: 1 andL0605: 1. 

L0598: 1,L0744: 1. 
L0748: 1 andS0031: 1. 

S0031: 1 

L0439:2andS0031: 1. 

L0748: 1 andS0031: 1. 

AR051: 41.AR054: 
28,AR050: 27 
S0031: 1 

L0773: 1,L0766: 1, | 

His-8 to Gly-18, 
Asp-129toLeu-134, 
Gly-139 to GIy-146. 

Asp-106 to Leu-Ill, 
Gly-116toGly-123. 

Arg-l to Ser-6, 
Ser-41 to Cys-58, 
Gly-68 to Ser-75. 

Ser-1 to Gly-7, 
Arg-42 to Gln-50. 

Gly-3 to Trp-10, 
Glu-22 to Gln-30, 
Ala-51toGln-56. 


Ala-21 to Met-26, , 
Ser-36toGIy-41. 



Gly-1 to Trp-9, 
Phe-13 to Lys-25, 
Lys-61 toLys-66. 


Thr-l to Asn-8. 

Cys-3 to Pro-15, 
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L0783: 1,L0809: 1, 
L0745: 1 andS0031: 1. 

S0031: 1 

S0031: 1 

S0031: 1 

AR061: 6,AR089: 3 
L0618: 1,L0770: 1, 
L0803:landS0031: 1. 

L0745: 1 andS0031: 1. 

S0031: 1 andL0592: 1. 

L0542: 1 andS0031: 1. 

L0439: 3, L0747: 3, 
L0742: 1,L0731: 1 and 
S0031: 1. 

S0031: 1 

L0439:4and S0031: 1. 

Pro-17toLeu-25, 
Pro-50 to Asp-60, 
Lys-74toLys-79. 

Asp- 1 toSer-20, 
Glu-42 to Arg-47, 
Gln-55 to Lys-61, 
Leu-65 to Gly-76. 

GlU-3 to Asp-8, 
Ser-27 to Leu-35, 
Pro-65 to Ser-73. 

Arg-26toVal-33, 
Ala-46 to Gly-60. 

Ala-155toGlu-160. 

Gly-1 to Leu-9, 
Pro-13 to Glu-20, 
Leu-36toSer-42. 

Asp-23 to Arg-30, 
His-39 to Thr-46, 
Gly-96 to Pro- 109. 


Gly-1 to Asn-8. 

His-8toGly-18, 
Pro-81 toThr-88. 

Tyr-40 to Arg-49. 
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L0754: 1 andS0031: 1. 

AR051: 20,AR054: 
14,AR050: 11 
S0031: 1 

AR050: 6,AR051: 2, 
AR054: 1 
S0031: 1 

S0031: 1 

S0031: 1 

L0439: 1 andS0031: 1. 

S0031: 1 

L0748: 1 andS0031: 1. 

L0439: 2, L0756: 1 and 
S0031: 1. 

L0439: 2, L0766: 1, 
L0740: 1,L0777: 1 and 
S0031: 1. 

Lys-31 toLeu-43. 

Arg-16 to Leu-21. 

Thr-56toArg-71. 

Gln-44 to Thr-52. 

Lys-47 to Thr-52. 


Ser-7 to Gly-15, 
Ala-21 to Thr-38. 

Ala-30 to Pro-35, 
Ala-44 to Asn-49. 

Tyr-21 to Tyr-28. 
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HSDFB32 
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HSDEV19 

HSDEU27 

HSDEU06 

HSDER30 

HSDEH43 

HSDEH39 
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